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Preface to the First Edition 


This textbook covers the basic properties of elliptic curves and modular 
forms, with emphasis on certain connections with number theory. The ancient 
“congruent number problem” is the central motivating example for most of 
the book. 

My purpose is to make the subject accessible to those who find it hard to 
read more advanced or more algebraically oriented treatments. At the same 
time I want to introduce topics which are at the forefront of current research. 
Down-to-earth examples are given in the text and exercises, with the aim of 
making the material readable and interesting to mathematicians in fields far 
removed from the subject of the book. 

With numerous exercises (and answers) included, the textbook is also 
intended for graduate students who have completed the standard first-year 
courses in real and complex analysis and algebra. Such students would learn 
applications of techniques from those courses, thereby solidifying their under- 
standing of some basic tools used throughout mathematics. Graduate stu- 
dents wanting to work in number theory or algebraic geometry would get a 
motivational, example-oriented introduction. In addition, advanced under- 
graduates could use the book for independent study projects, senior theses, 
and seminar work. 

This book grew out of lecture notes for a course I gave at the University of 
Washington in 1981-1982, and from a series of lectures at the Hanoi 
Mathematical Institute in April, 1983. I would like to thank the auditors of 
both courses for their interest and suggestions. My special gratitude is due to 
Gary Nelson for his thorough reading of the manuscript and his detailed 
comments and corrections. I would also like to thank Professors J. Buhler, B. 
Mazur, B. H. Gross, and Huynh Mui for their interest, advice and 
encouragement. 


vi Preface to the First Edition 


The frontispiece was drawn by Professor A. T. Fomenko of Moscow State 
University to illustrate the theme of this book. It depicts the family of elliptic 
curves (tori) that arises in the congruent number problem. The elliptic curve 
corresponding to a natural number »# has branch points at 0, 00, n and —n. In 
the drawing we see how the elliptic curves interlock and deform as the branch 
points +n go to infinity. 


Note: References are given in the form [Author year]; in case of multiple 


works by the same author in the same year, we use a, b, ... after the date to 
indicate the order in which they are listed in the Bibliography. 


Seattle, Washington NEAL KOBLITZ 


Preface to the Second Edition 


The decade since the appearance of the first edition has seen some major 
progress in the resolution of outstanding theoretical questions concerning 
elliptic curves. The most dramatic of these developments have been in the 
direction of proving the Birch and Swinnerton-Dyer conjecture. Thus, one 
of the changes in the second edition is to update the bibliography and the 
discussions of the current state of knowledge of elliptic curves. 

It was also during the 1980s that, for the first time, several important 
practical applications were found for elliptic curves. In the first place, the 
algebraic geometry of elliptic curves (and other algebraic curves, especially 
the curves that parametrize modular forms) were found to provide a source 
of new error-correcting codes which sometimes are better in certain respects 
than all previously known ones (see [van Lint 1988]). In the second place, 
H.W. Lenstra’s unexpected discovery of an improved method of factoring 
integers based on elliptic curves over finite fields (see [Lenstra 1987]) led to a 
sudden interest in elliptic curves among researchers in cryptography. Further 
cryptographic applications arose as the groups of elliptic curves were used as 
the “site” of so-called “public key” encryption and key exchange schemes (see 
[Koblitz 1987], [Miller 1986], [Menezes and Vanstone 1990]). 

Thus, to a much greater extent than I would have expected when I wrote this 
book, readers of the first edition came from applied areas of the mathematical 
sciences as well as the more traditional fields for the study of elliptic curves, 
such as algebraic geometry and algebraic number theory. 

I would like to thank the many readers who suggested corrections and 
improvements that have been incorporated into the second edition. 


Contents 


Preface to the First Edition v 
Preface to the Second Edition vii 
CHAPTER I 

From Congruent Numbers to Elliptic Curves 1 
1. Congruent numbers 3 
2. A certain cubic equation 6 
3. Elliptic curves 9 
4. Doubly periodic functions 14 
5. The field of elliptic functions 18 
6. Elliptic curves in Weierstrass form 22 
7. The addition law 29 
8. Points of finite order 36 
9. Points over finite fields, and the congruent number problem 43 
CHAPTER II 

The Hasse—Weil L-Function of an Elliptic Curve 51 
1. The congruence zeta-function 51 
2. The zeta-function of E£, 56 
3. Varying the prime p 64 
4. The prototype: the Riemann zeta-function 70 
5. The Hasse—Weil L-function and its functional equation 79 
6. The critical value 90 


x Contents 


CHAPTER ITI 


Modular forms 98 
1. SL,(Z) and its congruence subgroups 98 
2. Modular forms for SL3(Z) 108 
3. Modular forms for congruence subgroups 124 
4. Transformation formula for the theta-function 147 
5. The modular interpretation, and Hecke operators 153 
CHAPTER IV 
Modular Forms of Half Integer Weight 176 
1. Definitions and examples 177 
2. Eisenstein series of half integer weight for I'o(4) 185 
3. Hecke operators on forms of half integer weight 202 
4. The theorems of Shimura, Waldspurger, Tunnell, and the congruent 

number problem 212 
Answers, Hints, and References for Selected Exercises 223 
Bibliography 240 


Index 245 


CHAPTER I 


From Congruent Numbers to Elliptic 
Curves 


The theory of elliptic curves and modular forms is one subject where the 
most diverse branches of mathematics come together: complex analysis, 
algebraic geometry, representation theory, number theory. While our point 
of view will be number theoretic, we shall find ourselves using the type of 
techniques that one learns in basic courses in complex variables, real var- 
iables, and algebra. A well-known feature of number theory is the abundance 
of conjectures and theorems whose statements are accessible to high school 
students but whose proofs either are unknown or, in some cases, are the 
culmination of decades of research using some of the most powerful tools 
of twentieth century mathematics. 

We shall motivate our choice of topics by one such theorem: an elegant 
characterization of so-called “congruent numbers” that was proved by J. 
Tunnell [Tunnell 1983]. A few of the proofs of necessary results go beyond 
our scope, but many of the ingredients in the proof of Tunnell’s theorem will 
be developed in complete detail. 

Tunnell’s theorem gives an almost complete answer to an ancient problem: 
find a simple test to determine whether or not a given integer n is the area 
of some right triangle all of whose sides are rational numbers. A natural 
number v is called “congruent” if there exists a right triangle with all three 
sides rational and area n. For example, 6 is the area of the 3—4—5 right 
triangle, and so is a congruent number. 

Right triangles whose sides are integers X, Y, Z (a ‘‘Pythagorean triple’’) 
were studied in ancient Greece by Pythagoras, Euclid, Diophantus, and 
others. Their central discovery was that there is an easy way to generate all 
such triangles. Namely, take any two positive integers a and b with a > 5, 
draw the line in the wv-plane through the point (— 1, 0) with slope b/a. Let 
(u, v) be the second point of intersection of this line with the unit circle 
(see Fig. I.1). It is not hard to show that 
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(u,v) 


Figure I.1 


_a@—b? pa ab 
~ a? +b?’ ~ a? +b?" 


Then the integers ¥ = a? — b?, Y= 2ab, Z =a? + b? are the sides of a 
right triangle; the fact that X? + Y? = Z? follows because u? + v? = 1. By 
letting a and b range through all positive integers with a > 5, one gets all 
possible Pythagorean triples (see Problem | below). 

Although the problem of studying numbers which occur as areas of 
rational right triangles was of interest to the Greeks in special cases, it 
seems that the congruent number problem was first discussed systematically 
by Arab scholars of the tenth century. (For a detailed history of the problem 
of determining which numbers are “congruent”, see [L. E. Dickson 1952, 
Ch. XVI]; see also [Guy 1981, Section D27].) The Arab investigators 
preferred to rephrase the problem in the following equivalent form: given n, 
can one find a rational number x such that x? +” and x? —n are both 
squares of rational numbers? (The equivalence of these two forms of the 
congruent number problem was known to the Greeks and to the Arabs; for 
a proof of this elementary fact, see Proposition 1 below.) 

Since that time, some well-known mathematicians have devoted consid- 
erable energy to special cases of the congruent number problem. For 
example, Euler was the first to show that n =7 is a congruent number. 
Fermat showed that n= 1 is not; this result is essentially equivalent to 
Fermat’s Last Theorem for the exponent 4 (i.e., the fact that ¥* + Y* = Z* 
has no nontrivial integer solutions). 

It eventually became known that the numbers 1, 2, 3, 4 are not congruent 
numbers, but 5, 6, 7 are. However, it looked hopeless to find a straight- 
forward criterion to tell whether or not a given n is congruent. A major 
advance in the twentieth century was to place this problem in the context of 
the arithmetic theory of elliptic curves. It was in this context that Tunnell 
was able to prove his remarkable theorem. 


u 
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Here is part of what Tunnell’s theorem says (the full statement will be 
given later): 


Theorem (Tunnell). Let n be an odd squarefree natural number. Consider the 
two conditions: 


(A) n is congruent; 
(B) the number of triples of integers (x, y, z) satisfying 2x? + y? + 8z*=n 
is equal to twice the number of triples satisfying 2x? + y? + 32z? =n. 


Then (A) implies (B); and, if a weak form of the so-called Birch—Swinnerton- 
Dyer conjecture is true, then (B) also implies (A). 


The central concepts in the proof of Tunnell’s theorem—the Hasse—Weil 
L-function of an elliptic curve, the Birch-Swinnerton-Dyer conjecture, 
modular forms of half integer weight—will be discussed in later chapters. 
Our concern in this chapter will be to establish the connection between 
congruent numbers and a certain family of elliptic curves, in the process 
giving the definition and some basic properties of elliptic curves. 


§1. Congruent numbers 


Let us first make a more general definition of a congruent number. A 
positive rational number re @ is called a “‘congruent number” if it is the 
area of some right triangle with rational sides. Suppose r is congruent, and 
X, Y, Ze Q are the sides of a triangle with area r. For any nonzero re @ we 
can find some se @ such that s?r is a squarefree integer. But the triangle 
with sides s¥, sY, sZ has area s*r. Thus, without loss of generality we may 
assume that r=n is a squarefree natural number. Expressed in group 
language, we can say that whether or not a number r in the multiplicative 
group Q* of positive rational numbers has the congruent property depends 
only on its coset modulo the subgroup (Q*)? consisting of the squares of 
rational numbers; and each coset in Q*/(Q*)? contains a unique squarefree 
natural number r = n. In what follows, when speaking of congruent numbers, 
we shall always assume that the number is a squarefree positive integer. 

Notice that the definition of a congruent number does not require the 
sides of the triangle to be integral, only rational. While n = 6 is the smallest 
possible area of a right triangle with integer sides, one can find right triangles 
with rational sides having area n = 5. The right triangle with sides 14, 62, 62 
is such a triangle (see Fig. I.2). It turns out that n = 5 is the smallest congruent 
number (recall that we are using “‘congruent number” to mean “‘congruent 
squarefree natural number’’). 

There is a simple algorithm using Pythagorean triples (see the problems 
below) that will eventually list all congruent numbers. Unfortunately, given 
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vir 


2 
65 


Figure 1.2 


n, one cannot tell how long one must wait to get n if it is congruent; thus, 
if n has not appeared we do not know whether this means that n is not a 
congruent number or that we have simply not waited long enough. From a 
practical point of view, the beauty of Tunnell’s theorem is that his condition 
(B) can be easily and rapidly verified by an effective algorithm. Thus, his 
theorem almost settles the congruent number problem, i.e., the problem of 
finding a verifiable criterion for whether a given n is congruent. We must 
say ‘‘almost settles” because in one direction the criterion is only known to 
work in all cases if one assumes a conjecture about elliptic curves. 

Now suppose that X, Y, Z are the sides of a right triangle with area n. 
This means: X¥? + Y? = Z?, and XY =n. Thus, algebraically speaking, 
the condition that n be a congruent number says that these two equations 
have a simultaneous solution X, Y, ZeQ. In the proposition that follows, 
we derive an alternate condition for n to be a congruent number. In listing 
triangles with sides X, Y, Z, we shall not want to list X, Y, Z and Y, X, Z 
separately. So for now let us fix the ordering by requiring that ¥ << Y<Z 
(Z is the hypotenuse). 


Proposition 1. Let n be a fixed squarefree positive integer. Let X, Y, Z, x always 
denote positive rational numbers, with X < Y < Z. There is a one-to-one 
correspondence between right triangles with legs X and Y, hypotenuse Z, and 
area n, and numbers x for which x, x +n, and x — n are each the square of a 
rational number. The correspondence is: 


X, Y, Z>x =(Z/2)? 


x7X=/xtn-—VJ/x-—n, Y= V/xt+nt+ /x—-a4, Bla x. 


In particular, n is a congruent number if and only if there exists x such that x, 
x +n, and x — nare squares of rational numbers. 


Proor. First suppose that X, Y, Z is a triple with the desired properties: 
X? 4 Y*? = Z?,4XY =n. If we add or subtract four times the second equa- 
tion from the first, we obtain: (¥ + Y)? = Z? + 4n. If we then divide both 
sides by four, we see that x = (Z/2)* has the property that the numbers 
x +n are the squares of (¥ + Y)/2. Conversely, given x with the desired 
properties, it is easy to see that the three positive rational numbers ¥ < Y< Z 
given by the formulas in the proposition satisfy: XY = 2n, and X? + Y* = 
4x = Z*. Finally, to establish the one-to-one correspondence, it only remains 
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to verify that no two distinct triples X, Y, Z can lead to the same x. We leave 
this to the reader (see the problems below). oO 


PROBLEMS 


1. Recall that a Pythagorean triple is a solution (X, Y, Z) in positive integers to the 
equation X? + Y? = Z?. It is called “primitive” if ¥, Y, Z have no common factor. 
Suppose that a > 6 are two relatively prime positive integers, not both odd. Show 
that ¥ = a? — b?, Y= 2ab, Z=a? + 5? form a primitive Pythagorean triple, and 
that all primitive Pythagorean triples are obtained in this way. 


2. Use Problem | to write a flowchart for an algorithm that lists all squarefree con- 
gruent numbers (of course, not in increasing order). List the first twelve distinct 
congruent numbers your algorithm gives. Note that there is no way of knowing 
when a given congruent number vn will appear in the list. For example, 101 is a 
congruent number, but the first Pythagorean triple which leads to an area s? 101 
involves twenty-two-digit numbers (see [Guy 1981, p. 106]). One hundred fifty-seven 
is even worse (see Fig. I.3). One cannot use this algorithm to establish that some n 
is not a congruent number. Technically, it is not a real algorithm, only a “semi- 
algorithm”’. 


3. (a) Show that if 1 were a congruent number, then the equation x* — y* = u? would 
have an integer solution with u odd. 
(b) Prove that 1 is not a congruent number. (Note: A consequence is Fermat’s 
Last Theorem for the exponent 4.) 


4. Finish the proof of Proposition 1 by showing that no two triples X, Y, Z can lead 
to the same x. 


5. (a) Find xe(Q*)? such that x + Se(Q*)?. 
(b) Find xe(Q*)? such that x + 6€(Q*). 


22440351 77043369699245575 130906748631 60948472041 
8912332268928859588025 535 1 7896716357001 6480830 


6803298487826435051 217540 
411340519227716149383203 


411340519227716149383203 
2166655569371 4761309610 


Figure I.3. The Simplest Rational Right Triangle with Area 157 (computed by D. 
Zagier). 
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(c) Find two values x ¢(@*)* such that x + 210€(Q*)*. At the end of this chapter 
we shall prove that if there is one such x, then there are infinitely many. Equiva- 
lently (by Proposition 1), if there exists one right triangle with rational sides 
and area n, then there exist infinitely many. 


6. (a) Show that condition (B) in Tunnell’s theorem is equivalent to the condition that 
the number of ways n can be written in the form 2x? + y? + 8z? with x, y, z 
integers and z odd, be equal to the number of ways n can be written in this form 
with z even. 

(b) Write a flowchart for an algorithm that tests condition (B) in Tunnell’s theorem 
for a given n. 


7. (a) Prove that condition (B) in Tunnell’s theorem always holds if ” is congruent 
to 5 or 7 modulo 8. 

(b) Check condition (B) for all squarefree n = 1 or 3 (mod 8) until you find such 
an n for which condition (B) holds. 

(c) By Tunnell’s theorem, the number you found in part (b) should be the smallest 
congruent number congruent to | or 3 modulo 8. Use the algorithm in Problem 2 
to find a right triangle with rational sides and area equal to the number you 
found in part (b). 


§2. A certain cubic equation 


In this section we find yet another equivalent characterization of congruent 
numbers. 

In the proof of Proposition | in the last section, we arrived at the equations 
((X + Y)/2)? = (Z/2)? +n whenever X, Y, Z are the sides of a triangle with 
area n. If we multiply together these two equations, we obtain ((X? — Y7)/4)? 
= (Z/2)* — n?. This shows that the equation u* — n? = v? has a rational 
solution, namely, u = Z/2 and v = (X? — Y?)/4. We next multiply through 
by wu? to obtain u® — n?u? = (uv). If we set x = u? = (Z/2)? (this is the same 
x as in Proposition 1) and further set y = uv = (X? — Y7)Z/8, then we have 
a pair of rational numbers (x, y) satisfying the cubic equation: 


yp=xe—n’x. 


Thus, given a right triangle with rational sides X, Y, Z and area n, we 
obtain a point (x, y) in the xy-plane having rational coordinates and lying 
on the curve y? = x? — n?x. Conversely, can we say that any point (x, y) 
with x, ye Q which lies on the cubic curve must necessarily come from such 
a right triangle? Obviously not, because in the first place the x-coordinate 
x =u? =(Z/2)* must lie in (Q*)? if the point (x, y) can be obtained as in 
the last paragraph. In the second place, we can see that the x-coordinate of 
such a point must have its denominator divisible by 2. To see this, notice that 
the triangle X, Y, Z can be obtained starting with a primitive Pythagorean 
triple X’, Y’, Z’ corresponding to a right triangle with integral sides X’, Y’, Z’ 
and area sn, and then dividing the sides by s to get X, Y, Z. But in a primitive 
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Pythagorean triple X’ and Y’ have different parity, and Z’ is odd. We 
conclude that (1) x = (Z/2)? = (Z’/2s)” has denominator divisible by 2 and 
(2) the power of 2 dividing the denominator of Z is equal to the power of 2 
dividing the denominator of one of the other two sides, while a strictly lower 
power of 2 divides the denominator of the third side. (For example, in the 
triangle in Fig. I.2 with area 5, the hypotenuse and the shorter side have a 2 in 
the denominator, while the other leg does not.) We conclude that a necessary 
condition for the point (x, y) with rational coordinates on the curve y? = 
x* —n*x to come from a right triangle is that x be a square and that its 
denominator be divisible by 2. For example, when n = 31, the point (417/77, 
29520/7*) on the curve y? = x — 31?x does not come from a triangle, even 
though its x-coordinate is a square. 

Finally, a third necessary condition is that the numerator of x have no 
common factor with n. To see this, suppose that p > 2 is a prime dividing 
both n and the numerator of x. Then p divides the numerator of x tn = 
((X + Y)/2)’, and so it also divides the numerators of (X + Y)/2 and 
(X — Y)/2. Then p divides the numerators of the sum X and the difference Y. 
Hence p? divides n = 3X Y. But n was assumed to be squarefree. This contra- 
diction shows that x must be a square with even denominator and numerator 
prime to n. A numerical example (for which I thank Clas Léfwall) showing 
that the first two conditions alone are not sufficient is provided by the point 
(x, y) = (25/4, 75/8) on the curve y? = x? — n?x,n = 5S. 

We next prove that these three conditions are not only necessary but also 
sufficient for a point on the curve to come from a triangle. 


Proposition 2. Let (x, y) be a point with rational coordinates on the curve 
y* = x* — nx. Suppose that x satisfies the three conditions: (i) it is the square 
of a rational number, (ii) its denominator is even, and (iii) its numerator has no 
common factor with n. Then there exists a right triangle with rational sides and 
area n which corresponds to x under the correspondence in Proposition 1. 


Proor. Let u = ae € Q*. We work backwards through the sequence of steps 
at the beginning of this section. That is, set v = y/u, so that v? = y?/x = 
x? — n?, ie. v? + n? = x?. Now let t be the denominator of u, i.e., the smallest 
positive integer such that tue Z. By assumption, t is even. Notice that the 
denominators of v? and x? are the same (because n is an integer, and v? + n? = 
x*), and this denominator is t*. Thus, tv, t?n, t?x is a primitive Pythagorean 
triple, with tn even. (Here the primitivity of the triple follows from condition 
(iii).) By Problem 1 of §1, there exist integers a and b such that: t?n = 2ab, 
t*v = a’ — b’, t?x = a? + b?. Then the right triangle with sides 2a/t, 2b/t, 2u 
has area 2ab/t? = n, as desired. The image of this triangle X = 2a/t, Y = 2b/t, 
Z = 2u under the correspondence in Proposition 1 is x =: (Z/2)? = u?. This 
proves Proposition 2. oO 


We shall later prove another characterization of the points P = (x, y) on 
the curve y* = x* — nx which correspond to rational right triangles of 
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Figure 1.4 


area n. Namely, they are the points P = (x, y) which are “twice” a rational 
point P’ = (x’, y’). That is, P’ + P’ = P, where “+” is an addition law for 
points on our curve, which we shall define later. 


PROBLEMS 


1. Find a simple linear change of variables that gives a one-to-one correspondence 
between points on ny? =x>+ax?+bx+c and points on y? =x? + anx? + 
bn? x + cn>. For example, an alternate form of the equation y? = x* — n?x is the 
equation ny? = x> — x, 


2. Another correspondence between rational right triangles X, Y, Z with area XY =n 
and rational solutions to y? = x> — n?x can be constructed as follows. 


(a) 


(b) 
(c) 


(d) 


€) 


Parametrize all right triangles by letting the point u = X/Z, v = Y/Z on the unit 
circle correspond to the slope ¢ of the line joining (—1, 0) to this point (see 
Fig. I.4). Show that 

1-2? 2t 

—, v= : 

l+2 1+2 


(Note: This is the usual way to parametrize a conic. If t = a/b is rational, then 
the point (u, v) corresponds to the Pythagorean triple constructed by the method 
at the beginning of the chapter.) 

If we want the triangle X, Y, Z to have area n, express n/Z? in terms of t. 
Show that the point x = —nt, y =n?(1 + #7)/Z is on the curve y? = x3 — n? x. 
Express (x, y) in terms of X, Y, Z. 

Conversely, show that any point (x, y) on the curve y? = x — n*x with y #0 
comes from a triangle, except that to get points with positive x, we must allow 
triangles with negative X and Y (but positive area 4X Y = n), and to get points 
with negative y we must allow negative Z (see Fig. 1.5). Later in this chapter we 
shall show the connection between this correspondence and the one given in the 
text above. 

Find the points on y? = x? — 36x coming from the 3—4—5 right triangle and all 
equivalent triangles (4~3—5, (— 3)-(— 4)-5, etc.). 


3. Generalize the congruent number problem as follows. Fix an angle @ not necessarily 
90°. But suppose that 4 = cos 6 and B = sin @ are both rational. Let n be a square- 
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X,Y <0,Z>0 


X,¥,Z<0 
X,Y >0,Z<0 


Figure I.5 


free natural number. One can then ask whether n is the area of any triangle with 

rational sides one of whose angles is 6. 

(a) Show that the answer to this question is equivalent to a question about rational 
solutions to a certain cubic equation (whose coefficients depend on @ as well 
as n). 

(b) Suppose that the line joining the point (—1, 0) to the point (A, B) on the unit 
circle has slope A. Show that the cubic in part (a) is equivalent (by a linear 
change of variables) to the cubic ny? = x(x — A)(x + (1/A)). The classical con- 
gruent number problem is, of course, the case 1 = 1. 


§3. Elliptic curves 


The locus of points P = (x, y) satisfying y? = x — n?x is a special case of 
what’s called an “elliptic curve’’. More generally, let K be any field, and let 
J(x)€ K[x] be a cubic polynomial with coefficients in K which has distinct 
roots (perhaps in some extension of K). We shall suppose that K does not 
have characteristic 2. Then the solutions to the equation 


y? = f(x), (3.1) 


where x and y are in some extension K’ of K, are called the K’-points of the 
elliptic curve defined by (3.1). We have just been dealing with the example 
K=K’=Q, f(x)=x*>—n’x. Note that this example y? = x*> —n?x 
satisfies the condition for an elliptic curve over any field K of characteristic 
p, as long as p does not divide 2n, since the three roots 0, +n of f(x) = x3 — 
n*x are then distinct. 

In general, if x9, yy)¢K’ are the coordinates of a point on a curve C 
defined by an equation F(x, y) = 0, we say that C is “‘smooth”’ at (x9, yo) if 
the two partial derivatives 0F/0x and OF/dy are not both zero at (XQ, yo). 


10 I. From Congruent Numbers to Elliptic Curves 


This is the definition regardless of the ground field (the partial derivative 
of a polynomial F(x, y) is defined by the usual formula, which makes sense 
over any field). If K’ is the field R of real numbers, this agrees with the usual 
condition for C to have a tangent line. In the case F(x, y) = y? — f(x), the 
partial derivatives are 2yy and —f’(x9). Since K’ is nota field of characteristic 
2, these vanish simultaneously if and only if yp = 0 and x, is a multiple root 
of f(x). Thus, the curve has a non-smooth point if and only if f(x) has a 
multiple root. It is for this reason that we assumed distinct roots in the 
definition of an elliptic curve: an elliptic curve is smooth at all of its points. 

In addition to the points (x, y) on an elliptic curve (3.1), there is a very 
important “‘point at infinity” that we would like to consider as being on 
the curve, much as in complex variable theory in addition to the points on 
the complex plane one throws in a point at infinity, thereby forming the 
“Riemann sphere’. To do this precisely, we now introduce projective 
coordinates. 

By the “total degree” of a monomial x'y/ we mean i+ j. By the “total 
degree” of a polynomial F(x, y) we mean the maximum total degree of the 
monomials that occur with nonzero coefficients. If F(x, y) has total degree 
n, we define the corresponding homogeneous polynomial F(x, y, z) of three 
variables to be what you get by multiplying each monomial x'y/ in F(x, y) 
by 2" '/ to bring its total degree in the variables x, y, z up to ; in other 


words, 
F(x, y, 2) = (2, ‘), 
Zee, 


In our example F(x, y) = y? — (x3 — n?x), we have F(x, y, z) = y?z — x3 + 
n?xz?, Notice that F(x, y) = F(x, y, 1). 

Suppose that our polynomials have coefficients in a field K, and we are 
interested in triples x, y, z¢ K such that F(x, y, z) = 0. Notice that: 


(1) for any 2eEK, F(Ax, dy, 4z) = j"F(x, y, z) (n = total degree of F); 
(2) for any nonzero 4€ K, F(Ax, Ay, Az) = 0 if and only if F(x, y, z) = 0. In 
particular, for z 4 0 we have F(x, y, z) = 0 if and only if F(x/z, y/z) = 0. 


Because of (2), it is natural to look at equivalence classes of triples x, y, 
ze K, where we say that two triples (x, y, z) and (x’, y’, 2’) are equivalent if 
there exists a nonzero A€K such that (x’, )”, 2’) = 4(x, y, z). We omit the 
trivial triple (0, 0, 0), and then we define the “‘projective plane Px” to be 
the set of all equivalence classes of nontrivial triples. 

No normal person likes to think in terms of ‘‘equivalence classes”’, and 
fortunately there are more visual ways to think of the projective plane. 
Suppose that K is the field R of real numbers. Then the triples (x, y, z) in 
an equivalence class all correspond to points in three-dimensional Euclidean 
space lying on a line through the origin. Thus, PZ can be thought of geo- 
metrically as the set of lines through the origin in three-dimensional space. 

Another way to visualize Pj is to place a plane at a distance from the 
origin in three-dimensional space, for example, take the plane parallel to the 
xy-plane and at a distance | from it, i-e., the plane with equation z = 1. All 
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lines through the origin, except for those lying in the xy-plane, have a unique 
point of intersection with this plane. That is, every equivalence class of 
triples (x, y, z) with nonzero z-coordinate has a unique triple of the form 
(x, y, 1). So we think of such equivalence classes as points in the ordinary 
xy-plane. The remaining triples, those of the form (x, y, 0), make up the 
“line at infinity”’. 

The line at infinity, in turn, can be visualized as an ordinary line (say, 
the line y= 1 in the xy-plane) consisting of the equivalence classes with 
nonzero y-coordinate and hence containing a unique triple of the form 
(x, 1, 0), together with a single “‘point at infinity”’ (1, 0, 0). That is, we define 
the projective line Pj over a field K to be the set of equivalence classes of 
pairs (x, y) with (x, y) ~ (Ax, Ay). Then PZ can be thought of as an ordinary 
plane (x, y, 1) together with a projective line at infinity, which, in turn, 
consists of an ordinary line (x, 1, 0) together with its point at infinity (1, 0, 0). 

More generally, n-dimensional projective space Px is defined using 
equivalence classes of (7 + 1)-tuples, and can be visualized as the usual 
space of n-tuples (x,, ...,X,, 1) together with a PZ’ at infinity. But we 
shall only have need of P} and P?. 

Given a homogeneous polynomial F(x, y, z) with coefficients in K, we 
can look at the solution set consisting of points (x, y, z) in Pz (actually, 
equivalence classes of (x, y, z)) for which F(x, y, z) = 0. The points of this 
solution set where z #0 are the points (x, y, 1) for which F(x, y, 1) = 
F(x, y) = 0. The remaining points are on the line at infinity. The solution 
set of F(x, y,z)=0 is called the “projective completion’’ of the curve 
F(x, y) = 0. From now on, when we speak of a ‘“‘line’’, a ‘‘conic section”’, 
an “elliptic curve’’, etc., we shall usually be working in a projective plane 
Px, in which case these terms will always denote the projective completion 
of the usual curve in the xy-plane. For example, the line y = mx + 5 will 
really mean the solution set to y = mx + bz in Pg; and the elliptic curve 
y? = x — n?x will now mean the solution set to y?2z = x? — n?xz? in P2. 

Let us look more closely at our favorite example: F(x, y) = y? — x3 + n?x, 
F(x, y, 2) =y?z — x3 + n?xz?. The points at infinity on this elliptic curve 
are the equivalence classes (x, y, 0) such that 0 = F(x, y, 0) = —x?. ie., 
x = 0. There is only one such equivalence class (0, 1, 0). Intuitively, if we 
take K = R, we can think of the curve y* = x? — n?x heading off in an 
increasingly vertical direction as it approaches the line at infinity (see Fig. 
I.6). The points on the line at infinity correspond to the lines through the 
origin in the xy-plane, i.e., there is one for every possible slope y/x of such 
a line. As we move far out along our elliptic curve, we approach slope 
y/x = &, corresponding to the single point (0, 1, 0) on the line at infinity. 
Notice that any elliptic curve y* = f(x) similarly contains exactly one point 
at infinity (0, 1, 0). 

All of the usual concepts of calculus on curves F(x, y) = 0 in the xy-plane 
carry over to the corresponding projective curve F(x, y, z) = 0. Such notions 
as the tangent line at a point, points of inflection, smooth and singular 
points all depend only upon what is happening in a neighborhood of the 
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Figure I.6 


point in question. And any point in P2 has a large neighborhood which 
looks like an ordinary plane. More precisely, if we are interested in a point 
with nonzero z-coordinate, we can work in the usual xy-plane, where the 
curve has equation F(x, y) = F(x, y, 1) = 0. If we want to examine a point 
on the line z = 0, however, we put the triple in either the form (x, 1, 0) or 
(1, y, 0). In the former case, we think of it as a point on the curve F(x, 1, z) = 0 
in the xz-plane; and in the latter case as a point on the curve F(/, y, z) = 0 
in the yz-plane. 

For example, near the point at infinity (0, 1,0) on the elliptic curve 
y?z — x3 + n?xz? = 0,all points have the form (x, 1, z) withz — x? + n?xz? = 
0. The latter equation, in fact, gives us all points on the elliptic curve except 
for the three points (0, 0, 1), (n, 0, 1) having zero y-coordinate (these are 
the three ‘‘points at infinity” if we think in terms of xz-coordinates). 


PROBLEMS 


1. Prove that if K is an infinite field and F(x, y, z)€ K(x, y, z] satisfies F(x, Ay, Az) = 
2" F(x, y, Z) for all 4, x, y, zeK, then F is homogeneous, i.e., each monomial has 
total degree n. Give a counterexample if K is finite. 


2. By a “line” in P? we mean either the projective completion of a line in the xy-plane 
or the line at infinity. Show that a line in Pz has equation of the form ax + by + cz = 
0, with a, b, ce K not all zero; and that two such equations determine the same line 
if and only if the two triples (a, b, c) differ by a multiple. Construct a 1-to-1 cor- 
respondence between lines in a copy of PZ with coordinates (x, y, z) and points in 
another copy of Pz with coordinates (a, b, c) and between points in the xyz-projec- 
tive plane and lines in the abc-projective plane, such that a bunch of points are on 
the same line in the first projective plane if and only if the lines that correspond to 
them in the second projective plane all meet in the same point. The xyz-projective 
plane and the abc-projective plane are called the “‘duals’’ of each other. 
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3. How many points at infinity are on a parabola in P?? an ellipse? a hyperbola? 


4. Prove that any two nondegenerate conic sections in P are equivalent to one another 
by some linear change of variables. 


5. (a) 


(b) 


(c) 


(d 


~ 


(e 


~a 


6. (a) 


(b 


YS 


(c) 


If F(x, y, z)€K[x, y, z] is homogeneous of degree n, show that 
or of + Pins = nF. 

dz 
If K has characteristic zero, show that a point (x, y, z)¢PZ is a non-smooth 
point on the curve C: F(x, y, z)=0 if and only if the triple (0F/@x, dF /dy, 
dF/€z) is (0, 0, 0) at our particular (x, y, z). Give a counterexample if char K # 0. 
In what follows, suppose that char K = 0, e.g., K=R. 
Show that the tangent line to C at a smooth point (Xo, Yo, Zo) has equation 
ax + by + cz = 0, where 


_oF 


oF _ oF 
~ Ox 


fase c 


a > aan ae > on 
(Xs Yor Zo) dy (Xo. Yor Zo) dz (X95 Yo» Zo) 


Prove that the condition that (x, y, z) be a smooth point on C does not depend 
upon the choice of coordinates, i.e., it does not change if we shift to x’y’z’- 
coordinates, where (x’ y’ z’)=(x y z)A with A an invertible 3 x 3 matrix. 
For example, if more than one of the coordinates are nonzero, it makes no 
difference which we choose to regard as the “‘z-coordinate’’, i.e., whether we 
look at C in the xy-plane, the xz-plane, or the yz-plane. 

Prove that the condition that a given line / be tangent to C at a smooth point 
(x, y, Z) does not depend upon the choice of coordinates. 


Let P, = (x,,y,,2,) and P, = (x2, y2, 22) be two distinct points in P?. Show 
that the line joining P, and P, can be given in parametrized form as sP, + tP,, 
i.e., {(sx, + 1x, Sy, + ty, $z, + tz2)|s, te K}. Check that this linear map takes 
Px (with coordinates s, f) bijectively onto the line P, P, in PZ. What part of the 
line do you get by taking s = 1 and letting ¢ vary? 

Suppose that K = R or C. If the curve F(x, y) = 0 in the xy-plane is smooth at 
P, =(x,, 1) with nonvertical tangent line, then we can expand the implicit 
function y = f(x) in a Taylor series about x =x,. The linear term gives 
the tangent line. If we subtract off the linear term, we obtain f(x) — y, — 
St (x) (% — x1) =a, (x — x1)" +--+, wherea,, # 0,m > 2. mis called the ‘order 
of tangency”’. We say that (x,, y,) isa point of inflection ifm > 2,1.e.,f”(x,) = 0. 
(In the case K = R, note that we are not requiring a change in concavity with 
this definition, e.g., y = x* has a point of inflection at x = 0.) Let P, = (x;, 
V1» 21), 2; #0, and let /= P, P, be tangent to the curve F(x, y) = F(x, y, 1) at 
the smooth point P,. Let P, = (x2, y2, Z2). Show that m is the lowest power of 
t that occurs in F(x, + (x2, 91 + ty2, 2, + t2,)EK[]. 

Show that m does not change if we make a linear change of variables in P32. 
For example, suppose that y, and z, are both nonzero, and we use the xz-plane 
instead of the xy-plane in parts (a) and (b). 


7. Show that the line at infinity (with equation z = 0) is tangent to the elliptic curve 
y? = f(x) at (0, 1, 0), and that the point (0, 1, 0) is a point of inflection on the curve. 
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§4. Doubly periodic functions 


Let L be a lattice in the complex plane, by which we mean the set of all 
integral linear combinations of two given complex numbers w, and 2, 
where w, and w, do not lie on the same line through the origin. For example, 
if w, =i and @, = 1, we get the lattice of Gaussian integers {mi + n|m, 
neZ}. It will turn out that the example of the lattice of Gaussian integers 
is intimately related to the elliptic curves y? = x* — n?x that come from the 
congruent number problem. 
The fundamental parallelogram for w,, @, is defined as 


Tl = {aw, + bw,|(0<a<1,0<b< ]}. 


Since w,, @, form a basis for C over R, any number xeC can be written in 
the form x = aw, + baw, for some a, bER. Then x can be written as the 
sum of an element in the lattice L = {mw, + nw,} and an element in IT, and 
in only one way unless a or b happens to be an integer, i.e., the element of 
TI happens to lie on the boundary éI]. 

We shall always take w,, w, in clockwise order; that is, we shall assume 
that w,/w, has positive imaginary part. 

Notice that the choice of w,, @, giving the lattice L is not unique. For 
example, w, = w, + @, and @, give the same lattice. More generally, we 
can obtain new bases w{, w3 for the lattice L by applying a matrix with 
integer entries and determinant | (see Problem | below). 

For a given lattice L, a meromorphic function on C is said to be an elliptic 
function relative to L if f(z + 1) = f(z) for all Je L. Notice that it suffices to 
check this property for / = m, and / = w,. In other words, an elliptic func- 
tion is periodic with two periods w, and @,. Such a function is determined 
by its values on the fundamental parallelogram IT; and its values on opposite 
points of the boundary of II are the same, ie., f(aw, + m2) = f(aw,), 
f(a, + b@,) = f(bm,). Thus, we can think of an elliptic function f(z) as a 
function on the set IT with opposite sides glued together. This set (more 
precisely, ‘““complex manifold”’) is known as a “‘torus”’. It looks like a donut. 

Doubly periodic functions on the complex numbers are directly analogous 
to singly periodic functions on the real numbers. A function f(x) on R which 
satisfies f(x + nw) = f(x) is determined by its values on the interval [0, «]. 
Its values at 0 and @ are the same, so it can be thought of as a function on 
the interval [0, w] with the endpoints glued together. The “real manifold” 
obtained by gluing the endpoints is simply a circle (see Fig. I.7). 

Returning now to elliptic functions for a lattice L, we let &, denote the 
set of such functions. We immediately see that &, is a subfield of the field 
of all meromorphic functions, i.e., the sum, difference, product, or quotient 
of two elliptic functions is elliptic. In addition, the subfield &, is closed under 
differentiation. We now prove a sequence of propositions giving some very 
special properties which any elliptic function must have. The condition that 
a meromorphic function be doubly periodic turns out to be much more 
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baal 


Figure 1.7 


restrictive than the analogous condition in the real case. The set of real- 
analytic periodic functions with given period is much “larger” than the set 
é, of elliptic functions for a given period lattice L. 


Proposition 3. A function f(z)€&,, L = {mw, + nw}, which has no pole in 
the fundamental parallelogram TI must be a constant. 


Proor. Since I] is compact, any such function must be bounded on I, say 
by a constant M. But then | f(z)| < M for all z, since the values of f(z) are 
determined by the values on II. By Liouville’s theorem, a meromorphic 
function which is bounded on all of C must be a constant. oO 


Proposition 4. With the same notation as above, let « + TI denote the translate 
of II by the complex number a, i.e., {x + z|ze TI}. Suppose that f(z) €&,, has 
no poles on the boundary C of « + 1. Then the sum of the residues of f(z) in 
a + II is zero. 


Proor. By the residue theorem, this sum is equal to 


1 
Oni [vow 


But the integral over opposite sides cancel, since the values of f(z) at corre- 
sponding points are the same, while dz has opposite signs, because the path 
of integration is in opposite directions on opposite sides (see Fig. 1.8). Thus, 
the integral is zero, and so the sum of residues is zero. oO 


Notice that, since a meromorphic function can only have finitely many 
poles in a bounded region, it is always possible to choose an « such that the 
boundary of « + II misses the poles of f(z). Also note that Proposition 4 
immediately implies that a nonconstant f(z)¢&, must have at least two 
poles (or a multiple pole), since if it had a single simple pole, then the sum 
of residues would not be zero. 
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Figure 1.8 


Proposition 5. Under the conditions of Proposition 4, suppose that f(z) has no 
zeros or poles on the boundary of « + M1. Let {m,} be the orders of the various 
zeros ina + II, and let {n,} be the orders of the various poles. Then Xm, = X nj. 


Proor. Apply Proposition 4 to the elliptic function f’(z)/f(z). Recall that the 
logarithmic derivative f’(z)/f(z) has a pole precisely where f(z) has a zero 
or pole, such a pole is simple, and the residue there is equal to the order of 
zero or pole of the original f(z) (negative if a pole). (Recall the argument: If 
S(Z) = Cp (Z — a)" + +++, then f’(z) =c,,m(z — ay" 1 +--+, and so f’(z)/f(z) 
=m(z—a)'+.---.) Thus, the sum of the residues of f’(z)/f(z) is 2m; — 
in; = 0. Oo 


We now define what will turn out to be a key example of an elliptic 
function relative to the lattice L = {mw, + nw,}. This function is called the 
Weierstrass g-function. It is denoted g(z; L) or ~(Z; @,, @2), or simply 
yo(z) if the lattice is fixed throughout the discussion. We set 


1 1 1 

az oy ee eae aceea pence Ye 4.1 

e@)= e&; Fe * Ge — 1) i) a 
140 

Proposition 6. The sum in (4.1) converges absolutely and uniformly for z in 

any compact subset of C — L. 


Proor. The sum in question is taken over a two-dimensional lattice. The 
proof of convergence will be rather routine if we keep in mind a one- 
dimensional analog. If instead of L we take the integers Z, and instead of 
reciprocal squares we take reciprocals, we obtain a real function f(x) = 
442-1, +4, where the sum is over nonzero /€Z. To prove absolute and 
uniform convergence in any compact subset of R — Z, first write the sum- 
mand as x/(/(x — /)), and then use a comparison test, showing that the series 
in question basically has the same behavior as /~?. More precisely, use the 
following lemma: if = 5, is a convergent sum of positive terms (all our sums 
being over nonzero /€Z), and if f(x) has the property that | f(x)/b,| 
approaches a finite limit as / > +00, uniformly for x in some set, then the 
sum = f,(x) converges absolutely and uniformly for x in that set. The details 


§4. Doubly periodic functions 17 


are easy to fill in. (By the way, our particular example of f(x) can be shown 
to be the function 2 cot nx; just take the logarithmic derivative of both 
sides of the infinite product for the sine function: sin 2x = axII?_,(1 — 
2 5,2 
x*/n*).) 
The proof of Proposition 6 proceeds in the same way. First write the 
summand over a common denominator: 


1 1 _ az 
Z@—-l)? FP Z—D 


Then show absolute and uniform convergence by comparison with |/|~?, 
where the sum is taken over all nonzero /e L. More precisely, Proposition 6 
will follow from the following two lemmas. 


Lemma 1. [f 2 5, is a convergent sum of positive terms, where the sum is taken 
over all nonzero elements in the lattice L, and if X f\(z) has the property that 
| f:(z)/by| approaches a finite limit as |l| > 00, uniformly for z in some subset 
of C, then the sum = f,(z) converges absolutely and uniformly for z in that set. 


Lemma 2. 2 |/|~* converges if s > 2. 


The proof of Lemma | is routine, and will be omitted. We give a sketch 
of the proof of Lemma 2. We split the sum into sums over / satisfying 
n—1<|/|<n,asn=1,2, .... It is not hard to show that the number of / 
in that annulus has order of magnitude n. Thus, the sum in the lemma is 
bounded by a constant times £%,n-n-*=Zn'~5, and the latter sum 
converges for s—1> 1. 

This concludes the proof of Proposition 6. Oo 


Proposition 7. ¢9(z)€&,, and its only pole is a double pole at each lattice point. 


Proor. The same argument as in the proof of Proposition 6 shows that for 
any fixed /eL, the function g(z) — (z — /)~? is continuous at z = /. Thus, 
go(z) is a meromorphic function with a double pole at all lattice points and 
no other poles. Next, note that g(z) = ”(—z), because the right side of 
(4.1) remains unchanged if z is replaced by —z and / is replaced by —/; but 
summing over /e L is the same as summing over —/e L. 

To prove double periodicity, we look at the derivative. Differentiating 
(4.1) term-by-term, we obtain: 


1 
g'(z) = —2 ) ——... 
=~? 2 Gap 
Now g’(z) is obviously doubly periodic, since replacing z by z+ /, for 
some fixed /)¢Z merely rearranges the terms in the sum. Thus, g’(z)€é,. 
To prove that g(z)e&,, it suffices to show that o(z + w,) — e(z) = 0 for 
i= 1, 2. We prove this for i = 1; the identical argument applies to i = 2. 
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Since the derivative of the function g(z + ,) — @(2) is ~'(z + @,) — 
yo'(z) = 0, we must have g(z + w,) — v(z) = C for some constant C. But 
substituting z = —4@, and using the fact that g(z) is an even function, we 
conclude that C = g4a,) — ~(—4$a,) = 0. This concludes the proof. a 


Notice that the double periodicity of g(z) was not immediately obvious 
from the definition (4.1). 

Since 9(z) has exactly one double pole in a fundamental domain of the 
form « + I, by Proposition 5 it has exactly two zeros there (or one double 
zero). The same is true of any elliptic function of the form g(z) — u, where 
u is a constant. It is not hard to show (see the problems below) that go(z) 
takes every value ue C U {00} exactly twice on the torus (i.e., a fundamental 
parallelogram with opposite sides glued together), counting multiplicity 
(which means the order of zero of g(z)—u); and that the values as- 
sumed with multiplicity two are 00, e; = 9(@,/2), e. = (2/2), and e3 = 
((@, + @)/2). Namely, g(z) has a double pole at 0, while the other three 
points are the zeros of ’(z). 


§5. The field of elliptic functions 


Proposition 7 gives us a concrete example of an elliptic function. Just as 
sin x and cos x play a basic role in the theory of periodic functions on R, 
because of Fourier expansion, similarly the functions ¢9(z) and 49’(z) play a 
fundamental role in the study of elliptic functions. But unlike in the real 
case, we do not even need infinite series to express an arbitrary elliptic 
function in terms of these two basic ones. 


Proposition 8. &, = C(, @’), i.e., any elliptic function for L is a rational 
expression in §o(z;L) and g'(z; L). More precisely, given f(z)€&,, there 
exist two rational functions g(X), h(X) such that f(z) = g(e(2))+ 
9'(z)h(@(Z)). 
Proor. If f(z) is an elliptic function for L, then so are the two even functions 
{@ +f(—2) fQ)-f(=-2) 
5 and Pare ie 


Since f(z) is equal to the first of these functions plus ga’ (z) times the second, 
to prove Proposition 8 it suffices to prove 


Proposition 9. The subfield &{ < &, of even elliptic functions for L is generated 
by (2), i.e., ép ae C(g). 


Proor. The idea of the proof is to cook up a function which has the same 
zeros and poles as f(z) using only functions of the form g(z) — u with ua 
constant. 
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The ratio of f(z) to such a function is an elliptic function with no poles, and 
so must be a constant, by Proposition 3. 

Let f(z) €&] . We first list the zeros and poles of f(z). But we must do this 
carefully, in a special way. Let I’ be a fundamental parallelogram with two 
sides removed: I’ = {aw, + bw,|0 <a <1,0<6 <1}. Then every point 
in C differs by a lattice element from exactly one point in IT’; that is, I’ is 
a set of coset representatives for the additive group of complex numbers 
modulo the subgroup L. We will list zeros and poles in I’, omitting 0 from 
our list (even if it happens to be a zero or pole of f(z)). Each zero or pole 
will be listed as many times as its multiplicity. However, only “half” will 
be listed; that is, they will be arranged in pairs, with only one taken from 
each pair. We now give the details. We describe the method of listing zeros; 
the method of listing poles is exactly analogous. 

First suppose that aeII’, a #0, is a zero of f(z) which is not half of a 
lattice point, i.e., a # @,/2, w/2, or (w, + w2)/2. Let a* eI’ be the point 
“symmetric” to a, i.e., a* = w, + w, — a if ais in the interior of IT’, while 
a* = @, — aor a* = w, — aif ais on one of the two sides (see Fig. 1.9). If a 
is a zero of order m, we claim that the symmetric point a* is also a zero of 
order m. This follows from the double periodicity and the evenness of f(z). 
Namely, we have f(a* — z) = f(—a —z) by double periodicity, and this is 
equal to f(a + z) because f(z) is an even function. Thus, if f(a + z) = a,2" + 
higher terms, it follows that f(a* + z) = a,,(—z)™ + higher terms, i.e., a* is 
a zero of order m. 

Now suppose that aeIl’ is a zero of f(z) which is half of a lattice point; 
for example, suppose that a = w,/2. In this case we claim that the order of 
zero m is even. If f(a+z)=f(@, +z)=4,2"+ higher terms, then 
SG@, — 2) =f(—4@, + 2) = fGoa, + 2) by double periodicity and evenness. 
Thus, a,,z” + higher terms = a,,(—z)" + higher terms, and so m is even. 

We are now ready to list the zeros and poles of f(z). Let {a,} be a list of 
the zeros of f(z) in Tl’ which are not half-lattice points, each taken as many 
times as the multiplicity of zero there, but only one taken from each pair of 
symmetrical zeros a, a*; in addition, if one of the three nonzero half-lattice 
points in IT’ is a zero of f(z), include it in the list half as many times as its 
multiplicity. Let {5,} be a list of the nonzero poles of f(z) in II’, counted in 
the same way as the zeros (i.e., “‘only half” of them appear). 
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Since all of the a; and b; are nonzero, the values o(a,) and o(b,) are finite, 
and it makes sense to define the elliptic function 


Ni(@(Z) — g(a) 

T1(@(z) — @(6)))” 

We claim that g(z) has the same zeros and poles as f(z) (counting multiplic- 
ity), from which it will follow that f(z) = c-g(z) for some constant c. Since 
g(z) is a rational function of g(z), this will complete the proof. 

To prove this claim, we first examine nonzero points in IT’. Since 0 is the 
only pole in the numerator or denominator of g(z), it follows that the 
nonzero zeros of g(z) must come from the zeros of fo(z) — g(a;), while the 
nonzero poles of g(z) must come from the zeros of (z) — (6;). But we 
know (see problems below) that s9(z) — u (for constant u) has a double zero 
at z=u if wis a half-lattice point, and otherwise has a pair of simple zeros 
at u and the symmetric point u*. These are the only zeros of g(z) — uin IT’. 
By our construction of the a; and b,, we see that g(z) and f(z) have the same 
order of zero or pole everywhere in IT’, with the possible exception of the 
point 0. So it merely remains to show that they have the same order of zero 
or pole at 0. But this will follow automatically by Proposition 5. Namely, 
choose « so that no lattice point and no zero or pole of f(z) or g(z) is on the 
boundary of « + Tl. Then « + II will contain precisely one lattice point /. 
We know that f(z) and g(z) have the same orders of zeros and poles every- 
where in a + IT with the possible exception of /. Let m, denote the order of 
zero of f(z) at / (m, is negative if there is a pole), and let m, denote the anal- 
ogous order for g(z). Then 


gz) = 


m, + (total of orders of zeros of f) — (total of orders of poles of f) 
=m, + (total of orders of zeros of g) — (total of orders of poles of g). 


Since the corresponding terms in parentheses on both sides of the equality 
are equal, we conclude that m, = m,. Thus, Proposition 5 tells us that when 
we know that two elliptic functions have the same order of zero or pole 
everywhere but possibly at one point in the fundamental parallelogram, then 
that one point is carried along automatically. This concludes the proof of 
Proposition 9. Oo 


The proof of Propositions 8 and 9 was constructive, i.e., it gives us a 
prescription for expressing a given elliptic function in terms of 49(z) once 
we know its zeros and poles. Without doing any more work, for example, 
we can immediately conclude that: 


(1) the even elliptic function ”’(z)? is a cubic polynomial in g(z) (because 
g’(z) has a triple pole at 0 and three simple zeros, hence there are three 
a;s and no b,’s); 

(2) the even elliptic function g(Nz) (for any fixed positive integer N) is a 
rational function in g(z). 
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Both of these facts will play a fundamental role in what follows. The first 
tells us that the Weierstrass g-function satisfies a differential equation of a 
very special type. This equation will give the connection with elliptic curves. 
The second fact is the starting point for studying points of finite order on 
elliptic curves. Both facts will be given a more precise form, and the connec- 
tion with elliptic curves will be developed, in the sections that follow. 


PROBLEMS 


1. Prove that the lattice L = {mw, + nw,} and the lattice L’ = {mw + nw} are the 
same if and only if there is a 2 x 2 matrix A with integer entries and determinant 
+1 such that w’ = Aw (where w denotes the column vector with entries w,, @2). 
If the pairs w,, w2 and w}, w%, are each listed in clockwise order, show that det A = 
+1. 


2. Let C/L denote the quotient of the additive group of complex numbers by the 
subgroup L = {mw, + nw,}. Then C/L is in one-to-one correspondence with the 
fundamental parallelogram II with opposite sides glued together. 

(a) Let C be the circle group (the unit circle in the complex plane). Give a continuous 
group isomorphism from C/Z to the product of C with itself. 

(b) How many points of order N or a divisor of N are there in the group C/L? 

(c) Show that the set of subgroups of prime order p in C/L is in one-to-one corre- 
spondence with the points of PE (where F, = Z/pZ). How many are there? 


3. Let s = 2, 3,4, .... Fix a positive integer N, and let f: Z x Z >C be any function 
of period N, i.e., f(m + N,n) = f(m, n) and f(m, n+ N) = fim, n). Suppose that 
S(O, 0) = 0. If s =2, further suppose that = f(m, n) = 0, where the sum is over 
0<m,n<N. Define a function 


f(m, 2) 
AO y (2) ~ 2s (ma, + na>)° 
(a) Prove that this sum converges absolutely if s > 2 and conditionally if s = 2 
(in the latter case, take the sum over m and n in nondecreasing order of |mw, + 
no,|). 
(b) Express F,(@,, w) in terms of the values of 9(z; w,, @2) or a suitable derivative 
evaluated at values of zeTI for which Nze L (see Problem 2(b)). 


4. Show that for any fixed u, the elliptic function ¢9(z) — u has exactly two zeros (or a 
single double zero). Use the fact that g'(z) is odd to show that the zeros of ¢’(z) 
are precisely w,/2, w,/2, and (w, + w )/2, and that the values e, = go(@,/2), e, = 
(2/2), e3 = —((W, + w2)/2) are the values of u for which g(z) — u has a double 
zero. Why do you know that e;, @,, e3 are distinct? Thus, the Weierstrass y-function 
gives a two-to-one map from the torus (the fundamental parallelogram I with 
opposite sides glued together) to the Riemann sphere C vu {00} except over the four 
“branch points” e,, €,, @3, 00, each of which has a single preimage in C/L. 


5. Using the proof of Proposition 9, without doing any computations, what can you 
say about how the second derivative 4” (z) can be expressed in terms of go(z)? 
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As remarked at the end of the last section, from the proof of Proposition 9 
we can immediately conclude that the square of g9’(z) is equal to a cubic 
polynomial in g(z). More precisely, we know that yo’(z)? has a double zero 
at w,/2, w,/2, and (wm, + @,)/2 (see Problem 4 of §5). Hence, these three 
numbers are the a,’s, and we have 


9° (2) = C(p(z) — (4/2))(@(Z) — E(2/2))(@@) — E((@: + 2)/2)) 
= C(p(Z) — €1)(@(Z) — e2)(@@) — es), 


where C is some constant. It is easy to find C by comparing the coefficients 
of the lowest power of z in the Laurent expansion at the origin. Recall that 
(z) — z-* is continuous at the origin, as is ¢0’(z) + 2z~>. Thus, the leading 
term on the left is (— 2z~3)? = 4z~°, while on the right it is C(z~*)? = Cz~°. 
We conclude that C = 4. That is, g(z) satisfies the differential equation 


@'(z)’ =f(p(z)), where f(x) = 4(x — e,)(x — e2)(x — e3)€ CL x]. 
(6.1) 


Notice that the cubic polynomial fhas distinct roots (see Problem 4 of §5). 

We now give another independent derivation of the differential equation 
for g9(z) which uses only Proposition 3 from §4. Suppose that we can find a 
cubic polynomial f(x) = ax? + bx? + cx + dsuch that the Laurent expansion 
at 0 of the elliptic function /(%(z)) agrees with the Laurent expansion of 
go’(z)* through the negative powers of z. Then the difference go’(z)? — f(g(z)) 
would be an elliptic function with no pole at zero, or in fact anywhere else 
(since ga(z) and g’(z) have a pole only at zero). By Proposition 3, this differ- 
ence is a constant; and if we suitably choose d, the constant term in f(x), 
we can make this constant zero. 

To carry out this plan, we must expand g(z) and g’(z)* near the origin. 
Since both are even functions, only even powers of z will appear. 

Let c be the minimum absolute value of nonzero lattice points /. We shall 
take r < 1, and assume that z is in the disc of radius rc about the origin. 
For each nonzero /eL, we expand the term corresponding to / in the 
definition (4.1) of g~(z). We do this by differentiating the geometric series 
1d —x)=1+x+x?+.--- and then substituting z// for x: 


1 
par 4 42 
Ga f+ 35 + ate 


If we now subtract 1 from both sides, divide both sides by /*, and then 
substitute in (4. ‘ we obtain 


e(z) = a Hae t -+( 


120 
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We claim that this double series is absolutely convergent for |z| < rc, 
in which case the following reversal of the order of summation will be 
justified : 


1 


(2) = = t+ 3G42* + SGo2* + TGy2° + ---, (62) 


where for k > 2 we denote 


1 
G.=G(L)= —- VY [k= ee 6.3 
k x(L) Gl, 2) ae (mo, +nw,) (6.3) 
1#0 not both 0 
(notice that the G, are zero for odd k, since the term for / cancels the term 
for —/; as we expect, only even powers of z occur in the expansion (6.2)). 
To check the claim of absolute convergence of the double series, we write 
the sum of the absolute values of the terms in the inner sum in the form 
(recall: |z| < r|/|): 
= 3). i 5 2|z| 1 
Qz-\4j-3- (lL tir ter? +or 4+...) << 
lal ( Voy ee ) (dn? |?” 
and then use Lemma 2 from the proof of Proposition 6. 
We now use (6.2) to compute the first few terms in the expansions of 


(2), e(z)”, @(z)°, e(z), and ~’(z)’, as follows: 


g'(zZ) = 3 + 6G,z + 20Ggz7 + 42G,z7 + --:; (6.4) 
e'(2? = 3 = G.5 ~ 80G; + (36G2 — 168G,)z2 +--+; 6.5) 
p@)? = 4 4 6G, +:10Gs2? + +++; (6.6) 
e@)? = a cf 96a 4+ 15G, + (21G + 27G)zZ24---. (6.7) 


Recall that we are interested in finding coefficients a, b, c, d of a cubic 
I(x) = ax? + bx? + cx + d such that 


9'(z)? = ag(z)? + be(z)* + ca(z) +4, 


and we saw that it suffices to show that both sides agree in their expansion 
through the constant term. If we multiply equation (6.7) by a, equation (6.6) 
by 4, equation (6.2) by c, and then add them all to the constant d, and finally 
equate the coefficients of z~°, z~*, z~* and the constant term to the corre- 
sponding coefficients in (6.5), we obtain successively : 


a=4; b=0; —24G, = 4(9G,) + ¢; —80G, = 4(15G,) + d. 
Thus, c = —60G,, d= —140G,. It is traditional to denote 
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92 = 92(L) =, 60G, = 60 )) I*; 
140 
93 = 93(L) = 140G, = 140 Y I-°. 


leL 
1#0 


(6.8) 


We have thereby derived a second form for the differential equation (6.1): 
g'(z) =f(e(z)), where f(x) = 4x* —g,x—g3eC[x]. (6.9) 


Notice that if we were to continue comparing coefficients of higher powers 
of zin the expansion of both sides of (6.9), we would obtain relations between 
the various G, (see Problems 4—5 below). 

The differential equation (6.9) has an elegant and basic geometric inter- 
pretation. Suppose that we take the function from the torus C/L (i.e., the 
fundamental parallelogram I with opposite sides glued) to Pé defined by 


zt>(g(z), e(z), 1) for z40; 


aI 
0140, 1, 0). my 


The image of any nonzero point z of C/L is a point in the xy-plane (with 
complex coordinates) whose x- and y-coordinates satisfy the relationship 
y? = f(x) because of (6.9). Here f(x)€C[x] is a cubic polynomial with 
distinct roots. Thus, every point z in C/L maps to a point on the elliptic 
curve y? = f(x) in P2. It is not hard to see that this map is a one-to-one 
correspondence between C/L and the elliptic curve (including its point at 
infinity). Namely, every x-value except for the roots of f(x) (and infinity) 
has precisely two z’s such that go(z)= x (see Problem 4 of §5). The 
y-coordinates y = 4’(z) coming from these two z’s are the two square roots 
of f(x) = f(g(z)). If, however, x happens to be a root of f(x), then there is 
only one z value such that g(z) = x, and the corresponding y-coordinate is 
y = e’(z) = 0, so that again we are getting the solutions to y? = f(x) for our 
given x. 

Moreover, the map from C/L to our elliptic curve in PZ is analytic, meaning 
that near any point of C/L it can be given by a triple of analytic functions. 
Near non-lattice points of C the map is given by z+>(g(z), 9’(z), 1); and 
near lattice points the map is given by z+>(g(z)/@’(z), 1, 1/@’(z)), which is 
a triple of analytic functions near L. 

We have proved the following proposition. 


Proposition 10. The map (6.10) is an analytic one-to-one correspondence 
between C/L and the elliptic curve y? = 4x> — g,(L)x — g3(L) in PZ. 


One might be interested in how the inverse map from the elliptic curve 
to C/L can be constructed. This can be done by taking path integrals of 
dx|y = (4x? — g.x —g3) "dx from a fixed starting point to a variable 
endpoint. The resulting integral depends on the path, but only changes by 
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Figure I.10 


a “period’’, i.e., a lattice element, if we change the path. We hence obtain 
a well-defined map to C/L. See the exercises below for more details. 

We conclude this section with a few words about an algebraic picture 
that is closely connected with the geometric setting of our elliptic curve. 
Recall from Proposition 8 that any elliptic function (meromorphic function 
on the torus C/L) is a rational expression in g@(z) and g’(z). Under our 
one-to-one correspondence in Proposition 10, such a function is carried over 
to a rational expression in x and y on the elliptic curve in the xy-plane 
(actually, in P2). Thus, the field C(x, y) of rational functions on the xy-plane, 
when we restrict its elements to the elliptic curve y? = f(x), and then “pull 
back” to the torus C/L by substituting x = g9(z), y = g’(z), give us precisely 
the elliptic functions &,. Since the restriction of y? is the same as the restric- 
tion of f(x), the field of functions obtained by restricting the rational func- 
tions in C(x, y) to the elliptic curve is the following quadratic extension of 
C(x): C(x) Ly]/(y? — (4° — g.x — g;)). Algebraically speaking, we form 
the quotient ring of C(x)[y] by the principal ideal corresponding to the 
equation y? = f(x). 

Geometrically, projection onto the x-coordinate gives us Fig. I.10. Two 
points on the elliptic curve map to one point on the projective line, except 
at four points (the point at infinity and the three points where y = 0), where 
the two “‘branches”’ are “pinched” together. 

In algebraic geometry, one lets the field F = C(x) correspond to the com- 
plex line P{, and the field K = C(x, y)/(y? — (4x? — g2x — g3)) correspond 
to the elliptic curve in P2. The rings A = C[x] and B = C[x, y]/(y? — f(x) 
are the “‘rings of integers” in these fields. The maximal ideals in A are of 
the form (x — a)A; they are in one-to-one correspondence with aeC. A 
maximal ideal in B is of the form (x — a)B + (y — b)B (where b is a square 
root of f(a)), and it corresponds to the point (a, b) on the elliptic curve. 


K>B>(x-aB+(y—b)B (b= v/a) 


(x —aB+ (y+ 5)B 
| 
| | 


| 
: 
FDAD(x—aj)A 
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The maximal ideal (x — a)A, when “‘lifted up” to the ring B, is no longer 
prime. That is, the ideal (x — a)B factors into the product of the two ideals: 


(x — a)B= ((x —a)B4+(y — d)B)(x —-@B+ (y+ 5)B). 


The maximal ideal corresponding to the point a on the x-line splits into 
two maximal ideals corresponding to two points on the elliptic curve. If it 
so happens that b = 0, i.e., a is a root of f(x), then both of the ideals are the 
same, i.e., (x — a)B is the square of the ideal ((x — a)B + yB). In that case 
we say that the ideal (x — a)A “ramifies” in B. This happens at values a 
of the x-coordinate which come from only one point (a, 0) on the elliptic 
curve. Thus, the above algebraic diagram of fields, rings and ideals is an 
exact mirror of the preceding geometric diagram. 

We shall not go further than these ad hoc comments, since we shall not 
be using algebra geometric techniques in which follows. For a systematic 
introduction to algebraic geometry, see the textbooks by Shafarevich, 
Mumford, or Hartshorne. 


PROBLEMS 


1. (a) Let L = Z[Z] be the lattice of Gaussian integers. Show that g,(L) = 0 but that 
g,(L) is a nonzero real number. 

(b) Let L = Z[w], where w = 4(—1 + i/3), be the lattice of integers in the qua- 
dratic imaginary field Q(/—3). Show that g,(Z)=0 but that g3(L) is a 
nonzero real number. 

(c) For any nonzero complex number c, let cL denote the lattice obtained by 
multiplying all lattice elements by c. Show that g,(cL) = c~*g,(L), and g3(cL) = 
e78g5(L). 

(d) Prove that any elliptic curve y? = 4x° — g,x — g3 with either g, or g, equal 
to zero, is of the form y? = 4x° — g,(L)x — g3(L) for some lattice L. It can 
be shown that any elliptic curve is of that form for some lattice L. See, for 
example, [Whittaker & Watson 1958, §21.73]; also, we shall prove this much 
later as a corollary in our treatment of modular forms. 


2. Recall that the discriminant of a polynomial f(x) = agx" + a,x") +--+ +a,= 
p(x — €1))X — ey): (x — e,) is ab Me; — e))’. It is nonzero if and only if 
the roots are distinct. Since it is a symmetric homogeneous polynomial of degree 
n(n — 1) in the e;s, it can be written as a polynomial in the elementary symmetric 
polynomials in the es, which are (—1)'a,/a9. Moreover, each monomial term 
II; (a;/a)"i has total “weight” m, + 2m, + --- +m, equal to n(n — 1). Applying 
this to f(x) = 4x3 — gx — g3, we see that the discriminant is equal to a polynomial 
in gy, g3 of weight six, i.e., it must be of the form ag} + Bg3. Find « and B by com- 
puting 47(e, — e,)?(e, — e3)*(e2 — €3)” directly in the case g = 4, g; = 0 and the 
case g, = 0,g3=4. 


3. Since the even elliptic function g”(z) has a quadruple pole at zero and no other 
pole, you know in advance that it is equal to a quadratic polynomial in g(z). 
Find this polynomial in two ways: (a) comparing coefficients of powers of z; 
(b) differentiating 99’? = 49° — g,@ — g3. Check that your answers agree. 


S6. 
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. Use either the equation for 9’? or the equation for s9” to prove that Gg = 3G?. 


. Prove by induction that all G,s can be expressed as polynomials in G, and G, 


with rational coefficients, i.e., G,¢ @[G,, G,]. We shall later derive this fact again 
when we study modular forms (of which the G, turn out to be examples). 


. Let @, = it be purely imaginary, and let , = x. Show that as t approaches infinity, 


G,(it, m) approaches 2x ~*¢(k), where {(s) is the Riemann zeta-function. Suppose 
we know that ¢(2) = 27/6, ¢(4) = n*/90, (6) = 2°/945. Use Problem 4 to find 
(8). Use Problem 5 to show that x *¢(k)€Q for all positive even integers k. 


. Find the limit of g, and g, for the lattice L = {mit + nm} asi ow. 
. Show that v = csc? z satisfies the differential equation v’? = 4v?(v — 1), and that 
the function 
v=csc?z—4 


satisfies the differential equation v’? = 4v* — 4v — 4. What is the discriminant 
of the polynomial on the right? Now start with the infinite product formula for 
sin(1z), replace z by z/z, and take the logarithmic derivative and then the derivative 
once again to obtain an infinite sum for csc? z. Then prove that 


lim 4o(z; it, m) = csc? z — 4. 
lode @) 


. The purpose of this problem is to review the function z = log v for v complex, 


in the process providing a “dry run” for the problems that follow. 

(a) For v in a simply connected region of the complex plane that does not include 
the origin, define a function z of v by: 

__ fd 
1 t ; 

where the path from | to v is chosen arbitrarily, except that the same choice 
is made for all points in the region. (In other words, fix any path from | to 
Vo, and then to go to other v’s use a path from v, to v that stays in the region.) 
Call this function z = log v. Show that if a different path is chosen, the function 
changes by a constant value in the “lattice” L = {2zim}-; and that any lattice 
element can be added to the function by a suitable change of path. (Z is actually 
only a lattice in the imaginary axis Ri, not a lattice in C.) 

(b) Express dz/dv and dv/dz in terms of v. 

(c) If the function v = e* is defined by the usual series, use part (b) to show that 
e* is the inverse function of z = log v. 

(d) Show that the map e* gives a one-to-one correspondence between C/L and 
C — {0}. Under this one-to-one correspondence, the additive group law in 
C/L becomes what group law in C — {0}? 


Let L bea fixed lattice, set g. = g2(L), 93 = 93(L), e(z) = v(z; L). Letu = f(z) bea 
non-constant function on a connected open region R < C which satisfies the differ- 
ential equation u’* = 4u> — g,u — g3. Prove that u = o(z + «) for some constant «. 


. Let L= (mw, + nw} be a fixed lattice, and set g, = g,(L), g; = 93(L), e(z) = 


g(z,L). Let R, be an unbounded simply connected open region in the complex 
plane which does not contain the roots e,, e,, e; of the cubic 4x7 — g,x — gy. 
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For we R,, define a function z = g(u) by 


z= g(u) \- z 
u V4 — 93-93. 


where a fixed branch of the square root is chosen as ¢ varies in R,. Note that the 
integral converges and is independent of the path in R, from u to 00, since R, is 
simply connected. The function z = g(u) can be analytically continued by letting 
R, be a simply connected region in C — {e,, e2, e3} which overlaps with R,. If 
ueéR,, then choose u,ER,ARz, and set z=g(u) =g(u,) + Je (40 — gt — 
g3) ‘dt. This definition clearly does not depend on our choice of u,€R, O R2 or 
our path from u to uv, in R,. Continuing in this way, we obtain an analytic function 
which is multivalued, because our sequence of regions R,, Rj, R3, ... can wind 
around é,, €5, OF e3. 

(a) Express (dz/du)? and (du/dz)? in terms of u. 

(b) Show that u = g(z). In particular, when we wind around e,, e2, or e; the 
value of z can only change by something in L. Thus, z = g(u) is well defined 
as an element in C/Z for ueC — {e,, e3, 3}. The function z= g(u) then 
extends by continuity to e,, e2, 3. 

(c) Let C, be the path in the complex u-plane from e, to oo that is traced by u = 
o(z) as z goes from w,/2 to 0 along the side of II (see Fig. I.11). Show that 
Jc, (40? — gt — g3) "dt = —a,/2 for a suitable branch of the square root. 

(d) Let C, be the path that goes from oo to e, along C,, winds once around e, 
and then returns along C, to 0. Take the same branch of the square root as 
in part (c), and show that fc, (41° — got — 93) "?dt = wo. 

(e) Describe how the function z = g(u) can be made to give all preimages of u 
under u = g(z). 


. (a) Prove that all of the roots e;, e2, ¢; of 4x* — gx — g; are real if and only 


if g, and g, are real and A = g3 — 2793 > 0. 
(b) Suppose that the conditions in part (a) are met, and we order the e; so that 
€, > €3 > e,. Show that we can choose the periods of L to be given by 


1 o[et dt 1 . dt 
-o@,=i and -w,= , 
2 -2V93 + g2t— 40° 2 e, V4t? — gat — 93 


where we take the positive branch of the square root, and integrate along the 
real axis. 

(c) With these assumptions about the location of the e; on the real axis, describe 
how to change the path of integration and the branch of the square root in 
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Problem 11 so as to get the other values of z for which u = g(z), namely 
+z+ moa, + na. 


13. Suppose that g, = 4n”, g, =0. Take e,, e, e; so that e, >e; >e,. What are 
€1, €2, €3 in this case? Show that w, = iw,, 1.e., the lattice L is the Gaussian integer 
lattice expanded by a factor of w,. Show that as z travels along the straight line 
from @,/2 to w,/2 + @, the point (x, y) = (9(z), 9’(z)) moves around the real 
points of the elliptic curve y? = 4(x? — n?x) between —n and 0; and as z travels 
along the straight line from 0 to w, the point (x, y) = (g@(z), @’(z)) travels through 
all the real points of this elliptic curve which are to the right of (n, 0). Think of 
the ‘‘open”’ appearance of the latter path to be an optical illusion: the two ends are 
really ‘“‘tied together” at the point at infinity (0, 1, 0). 


1 t"dt n 135 1 
14. (a) Show that ee forn=0,1,2,.... 
®) \ Jti—-t) at 222 ( ;) 
(b) Under the conditions of Problem 12, with e, > e; > e,,setd = ss — 2 €(0, 1). 
27 e 


Derive the formula: 


ia 1 dt 
ete, leit =O =o). 


(c) Derive the formula w, = n(e, — e,) "?F(A), where 


= {13.5 1\}? a" 
F(A) = patente : 
Cpe E a5 (" ;)| nl 
The function F(A) is called a “hypergeometric series”’. 


(d) Show that the hypergeometric series in part (c) satisfies the differential equa- 
tion: A(l — AF’(A) + 1 — 24) F(A) — GFA) = 0. 


§7. The addition law 


In the last section we showed how the Weierstrass (9-function gives a 
correspondence between the points of C/Z and the points on the elliptic 
curve y? = f(x) = 4x3 — g,(L)x — g3(L) in P2. We have an obvious addition 
law for points in C/L, obtained from ordinary addition of complex numbers 
by dividing by the additive subgroup JL, i.e., ordinary addition “‘modulo L”’. 
This is the two-dimensional analog of ‘‘addition modulo one”’ in the group 
R/Z. 

We can use the correspondence between C/Z and the elliptic curve to 
carry over the addition law to the points on the elliptic curve. That is, to 
add two points P,; = (x,, y,) and P, = (x, y2), by definition what we do is 
go back to the z-plane, find z, and z, such that P, = (g(z,), ’(z,)) and 
P, =(@(Z2), 9’(Z2)), and then set P, + P, = (99(z; + 22), 9’ (Zz, + Z2)). This 
is just a case of the general principle: whenever we have a one-to-one corre- 
spondence between elements of a commutative group and elements of some 
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at Ww, 


at Ww) 


Figure I.12 


other set, we can use this correspondence to define a commutative group 
law on that other set. 

But the remarkable thing about the addition law we obtain in this way is 
that (1) there is a simple geometric interpretation of “‘adding”’ the points on 
the elliptic curve, and (2) the coordinates of P, + P, can be expressed 
directly in terms of x,, x, ¥,, ¥2 by rather simple rational functions. The 
purpose of this section is to show how this is done. 

We first prove a general lemma about elliptic functions. 


Lemma. Let f(z)€&,. Let T] = {aw, + bw,|0 < a, b < 1} be a fundamental 
parallelogram for the lattice L, and choose « so that f(z) has no zeros or poles 
on the boundary of « + TI. Let {a;} be the zeros of f(z) in « + 1, each repeated 
as many times as its multiplicity, and let {bj} be the poles, each occurring as 
many times as its multiplicity. Then La; — Xbe L. 


Proor. Recall that the function /’(z)/f(z) has poles at the zeros and poles 
of f(z), and its expansion near a zero a of order m is m/(z — a) + --- (and 
near a pole b of order —m the expansion is —m/(z — 6) + ---). Then the 
function zf’(z)/f(z) has the same poles, but, writing z = a + (z — a), we see 
that the expansion starts out am/(z — a). We conclude that La; — 26; is 
the sum of the residues of zf’(z)/f(z) inside « + IT. Let C be the boundary 
of « + TI. By the residue theorem, 


£04 
Ea So 5a | 


We first take the integral over the pair of opposite sides from « to a+ @, 
and from « + w, toa + @, + @, (see Fig. 1.12). This part is equal to 


ahs at@, AG) z ata, +@, AC) 
mil) me* | Fe) 


el iO i £2) 
5 mall sre ) dz — { (Z + @,) 40) a) 


+ 
sa me? £@) 
~ O15 [ a 


f() 


§7. The addition law 31 


Now make the change of variables u = f(z), so that f’(z)dz/f(z) = du/u. Let 
C, be the closed path from f(a) to f(« + w,) = f(a) traced by u = f(z) as z 
goes from a to a + w,. Then 


ee ae = 1 du 
ai f(2) Qn ae : 


and this is some integer n, eee the number of times the closed path C, 
winds around the origin (counterclockwise). Thus, we obtain —w,n for this 
part of our original integral. In the same way, we find that the integral over 
the remaining two sides of C is equal to —w,m for some integer m. Thus, 
Za; — 2b; = —nw, — maw, € L, as desired. This proves the lemma. oO 


We are now ready to derive the geometrical procedure for adding two 
points on the elliptic curve y? = f(x) = 4x? — g,(L)x — g3(L). For z in C/L, 
let P, be the corresponding point P, = (g(z), g’(z), 1), Py = (0, 1, 0) on the 
elliptic curve. Suppose we want to add P,, = (%1,y1) to P,, =(xX2, yz) to 
obtain the sum P_,., = (x3, ¥3). We would like to know how to go from 
the two points to their sum directly, without tracing the points back to the 
z-plane. 

We first treat some special cases. The additive identity is, of course, the 
image of z = 0. Let 0 denote the point at infinity (0, 1, 0), i.e., the additive 
identity of our group of points. The addition is trivial if one of the points 
is 0, 1.e., if z; or z, is zero. Next, suppose that P,, and P,, have the same 
x-coordinate but are not the same point. This means that Xy=X1,V2= —Jyq. 
In this case z; = —z,, because only ‘“‘symmetric”’ values of z (values which 
are the negatives of each other modulo the lattice L) can have the same 
g-value. In this case, P,, + P,, = P) = 0, i.e., the two points are additive 
inverses of one another. Speaking geometrically, we say that two points of 
the curve which are on the same vertical line have sum 0. We further note 
that in the special situation of a point P,, = P,, on the x-axis, we have 
V2 = —y, = 0, and it is easy to check that we still have P+ P,, = 2P,, =0. 
We have proved : 


Proposition 11. The additive inverse of (x, y) is (x, —y). 


Sivsn two points P; = P,, = (x1, y,)and P, = P,, = (2, 2) on the elliptic 
curve y? = 4x? — g,x —g, ‘(neither the point at infinity 0), there is a line 
/= P,P, joining them. If P,; = P,, we take / to be the tangent line to the 
eliptic curve at P,. If / is a vertical line, then we saw that P, + P, =0. 
Suppose that / is not a vertical line, and we want to find P, + P, = P; = 
(x3, ¥3). Our basic claim is that —P, = (x3, —y3) is the third point of 
intersection of the elliptic curve with /. 

Write the equation of / = P, P, in the form y = mx + b. A point (x, y) on 
/ is on the elliptic curve if and only if (mx + b)? = f(x) = 4x3 — g)x — g3, 
that is, if and only if x is a root of the cubic f(x) — (mx + b)*. This cubic 
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has three roots, each of which gives a point of intersection. If x is a double 
root or triple root, then / intersects the curve with multiplicity two or three 
at the point (x, y) (see Problem 6 of §I.3). In any case, the total number of 
points of intersection (counting multiplicity) is three. 

Notice that vertical lines also intersect the curve in three points, including 
the point at infinity 0; and the line at infinity has a triple intersection at 0 
(see Problem 7 of §1.3). Thus, any line in P2 intersects the curve in three 
points. This is a special case of 


Bezout’s Theorem. Let F(x, y, z) and G(x, y, z) be homogeneous polynomials 
of degree m and n, respectively, over an algebraically closed field K. Suppose 
that F and G have no common polynomial factor. Then the curves in Pz defined 
by F and G have mn points of intersection, counting multiplicities. 


For a more detailed discussion of multiplicity of intersection and a proof 
of Bezout’s theorem, see, for example, Walker’s book an algebraic curves 
[Walker 1978]. 

In our case F(x, y, 2) =y?2z — 4x3 + g)xz? +932? and G(x, y,z)= 
y — mx — bz. 


Proposition 12. [f P, + P, = P;, then —P; is the third point of intersection 
of |= P,P, with the elliptic curve. If P, = P,, then by P,P, we mean the 
tangent line at P,. 


Proor. We have already treated the case when P, or P, is the point at infinity 
0, and when P, = — P,. So suppose that / = P,P, has the form y = mx + b. 
Let P, = P,,, P, = P,,. To say that a point P, = (g(z), go'(z)) is on / means 
that 9’(z) = mg(z) + b. The elliptic function @’(z) — mg(z) — 5 has three 
poles and hence three zeros in C/L. Both z, and z, are zeros. According to 
the lemma proved above, the sum of the three zeros and three poles is equal 
to zero modulo the lattice L. But the three poles are all at zero (where g9(z) 
has a triple pole); thus, the third zero is —(z, + z,) modulo the lattice. 
Hence, the third point of intersection of / with the curve is P_(..4.,) = —P.,, 
as claimed. 

The argument in the last paragraph is rigorous only if the three points 
of intersection of / with the elliptic curve are distinct, in which case a zero 
of 9’(z) — mg(z) — b corresponds exactly to a point of intersection P,. 
Otherwise, we must show that a double or triple zero of the elliptic function 
always corresponds to a double or triple intersection, respectively, of / 
with the curve. That is, we must show that the two meanings of the term 
“multiplicity” agree: multiplicity of zero of the elliptic function of the 
variable z, and multiplicity of intersection in the xy-plane. 

Let z,, Z,, —Z3 be the three zeros of g’(z) — mg(z) — 5, listed as many 
times as their multiplicity. Note that none of these three points is the negative 
of another one, since / is not a vertical line. Since —z,;, —Z, Z3 are the three 


§7. The addition law 33 


Py 
Py 


Pi +P, 


Figure I.13 


zeros of g’(z) + mg(z) + b, it follows that +z,, +22, +23 are the six 
zeros of §9'(z)” — (mga(z) + b)” = f(@(z)) — (mez) + 6)? = 4(@) — x1) 
(9(z) — X2)(9(z) — x3), where x,, x, x3 are the roots of f(x) — (mx + 5)?. 
If, say, g0(z,) = x,, then the multiplicity of x, depends upon the number 
of +z,, +z3 which equal +z,. But this is precisely the number of z,, —z, 
which equal z,. Hence “multiplicity” has the same meaning in both cases. 
This concludes the proof of Proposition 12. Oo 


Proposition 12 gives us Fig. 1.13, which illustrates the group of real points 
on the elliptic curve y? = x* — x. To add two points P, and P,, we draw the 
line joining them, find the third point of intersection of that line with the 
curve, and then take the symmetric point on the other side of the x-axis. 

It would have been possible to define the group law in this geometrical 
manner in the first place, and prove directly that the axioms of an abelian 
group are satisfied. The hardest part would have been the associative law, 
which would have necessitated a deeper investigation of intersections of 
curves. In turns out that there is some flexibility in defining the group law. 
For example, any one of the eight points of inflection besides the point at 
infinity could equally well have been chosen as the identity. For details of 
this alternate approach, see [Walker 1978]. 

One disadvantage of our approach using g9(z) is that a priori it only applies 
to elliptic curves of the form y* = 4x? — g,(L)x — g3(L) or curves that can 
be transformed to that form by a linear change of variables. (Note that the 
geometrical description of the group law will still give an abelian group law 
after a linear change of variables.) In actual fact, as was mentioned earlier 
and will be proved later, any elliptic curve over the complex numbers can 
be transformed to the Weierstrass form for some lattice L. We already know 
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that our favorite example y? = x* — n*x corresponds to a multiple of the 
Gaussian integer lattice. In the exercises for this section and the next, we 
shall allow ourselves to use the fact that the group law works for any elliptic 
curve. 

It is not hard to translate this geometrical procedure into formulas 
expressing the coordinates (x3, y3) of the sum of P, =(x,,y,) and P, = 
(x2, y2) in terms of x,, X2, y,, yz and the coefficients of the equation of the 
elliptic curve. Although, strictly speaking, our derivation was for elliptic 
curves in the form y? = f(x) = 4x3 — g,(L)x — g3(L) for some lattice L, the 
procedure gives an abelian group law for any elliptic curve y? = f(x), as 
remarked above. So let us take f(x) = ax* + bx? + cx + deC[x] to be any 
cubic with distinct roots. 

In what follows, we shall assume that neither P, nor P, is the point at 
infinity 0, and that P; # —P,. Then the line through P, and P, (the tangent 
line at P, if P; = P,) can be written in the form y = mx + B, where m= 
(2 — V/(X2 — X1) if P, A P, and m = dy/dx|,,,.,, if P, = P). In the latter 
case we can express m in terms of x, and y, by implicitly differentiating 
y? = f(x); we find that m=/"(x,)/2y,. In both cases the y-intercept is 
B=), — mx, 

Then x,, the x-coordinate of the sum, is the third root of the cubic 
I(x) — (mx + B)*, two of whose roots are x,, x,. Since the sum of the three 
roots is equal to minus the coefficient of x? divided by the leading coefficient, 


we have: x, + x, +x; = —(b — m’)/a, and hence: 
= ee. BARS Z : : 
X3= ~X,— X2 24 3(B=% : i Ree Pas (7.1) 
b If f(x)\ : 
= ote! — | P. — p 2 
pay 2x, Pat aes if P, =P, (7.2) 


The y-coordinate y, is the negative of the value y = mx, + B, ie., 
Ya3=-)1 + mx, = x3), (7.3) 
where x; is given by (7.1) and (7.2), and 
mM=(¥2-—Vi%.—%1) if Py AP; 
m= f'(x1)/2y, if P, =P). 


(7.4) 


If our elliptic curve is in Weierstrass form y? = 4x? — g)x — g3, then 
we have a= 4, b=0, and f’(x,) = 12x? — g, in the addition formulas 
(7.1)-(7.4). 

In principle, we could have simply defined the group law by means of 
these formulas, and then verified algebraically that the axioms of a commu- 
tative group are satisfied. The hardest axiom to verify would be associativity. 
Tedious as this procedure would be, it would have one key advantage over 
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either the complex-analytic procedure (using g(z)) or the geometrical pro- 
cedure. Namely, we would never have to use the fact that our field K over 
which the elliptic curve is defined is the complex numbers, or even that it has 
characteristic zero. That is, we would find that our formulas, which make 
sense over any field K of characteristic not equal to 2, give an abelian group 
law. That is, if y? = f(x) = ax? + bx? + cx + de K[x] is the equation of 
an elliptic curve over K, and if we define f(x) = 3ax? + 2bx +c, then any 
two points having coordinates in some extension of K can be added using 
the formulas (7.1)—-(7.4). We shall make use of this fact in what follows, 
even though, strictly speaking, we have not gone through the tedious purely 
algebraic verification of the group laws. 


PROBLEMS 


1. Let L CR be the additive subgroup {mm} of multiples of a fixed nonzero real 
number w. Then the function z+>(cos(27z/m), sin(27z/w)) gives a one-to-one 
analytic map of R/L onto the curve x? + y? = 1 in the real xy-plane. Show that 
ordinary addition in R/L carries over to a rational (actually polynomial) law for 
“adding” two points (x,, y,) and (x2, y2) on the unit circle; that is, the coordinates 
of the “‘sum” are polynomials in x,, x2, y,, ¥2. Thus, the rational addition law on 
an elliptic curve can be thought of as a generalization of the formulas for the sine 
and cosine of the sum of two angles. 


2. (a) Simplify the expression for the x-coordinate of 2P in the case of the elliptic 
curve y? = x° — n?x. 

(b) Let X, Y, Z be a rational right triangle with area n. Let P be the corresponding 
point on the curve y? = x* — n?x constructed in the text in $1.2. Let Q be the 
point constructed in Problem 2 of §1.2. Show that P = 2Q. 

(c) Prove that, if P is a point not of order 2 with rational coordinates on the curve 
y? = x3 — nx, then the x-coordinate of 2P is the square of a rational number 
having even denominator. For example, the point Q = ((41/7)?, 720: 41/7°) 
on the curve y? = x? — 31?x is not equal to twice a point P having rational 
coordinates. (In this problem, recall: » is always squarefree.) 


3. Describe geometrically: (a) the four points of order two on an elliptic curve; (b) 
the nine points of order three; (c) how to find the twelve points of order four which 
are not of order two; (d) what the associative law of addition says about a certain 
configuration of lines joining points on the elliptic curve (draw a picture). 


4. (a) How many points of inflection are there on an elliptic curve besides the point 
at infinity? Notice that they occur in symmetric pairs. Find an equation for 
their x-coordinates. 

(b) In the case of the elliptic curve y? = x* — n?x find an explicit formula for 
these x-coordinates. Show that they are never rational (for any 7). 


5. Given a point Q on an elliptic curve, how many points P are there such that 2P = Q? 
Describe geometrically how to find them. 


6. Show that if K is any subfield of C containing g, and g3, then the points on the 
elliptic curve y? = 4x? — g,x — g, whose coordinates are in K form a subgroup 
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of the group of all points. More generally, show that this is true for the elliptic 
curve y? = f(x) if f(x)e Kx]. 


. Consider the subgroup of all points on y? = x? — n?x with real coordinates. How 


many points in this subgroup are of order 2? 3? 4? Describe geometrically where 
these points are located. 


. Same as Problem 7 for the elliptic curve y? = x° — a,aeR. 


. If y? = f(x) is an elliptic curve in which f(x) has real coefficients, show that the 


group of points with real coordinates is isomorphic to (a) R/Z if f(x) has only 
one real root; (b) R/Z x Z/2Z if f(x) has three real roots. 


. Letting a approach zero in Problem 8, show that for the curve y? = x? the same 


geometric procedure for finding P, + P, as for elliptic curves makes the smooth 
points of the curve (i.e., P # (0, 0), but including the point at infinity) into an 
abelian group. Show that the map which takes P = (x, y) to x/y (and takes the 
point at infinity to zero) gives an isomorphism with the additive group of complex 
numbers. This is called ‘additive degeneracy” of an elliptic curve. One way to 
think of this is to imagine both w, and w, approaching infinity (in different direc- 
tions). Then g, and g, both approach zero, so the equation of the corresponding 
elliptic curve approaches y? = 4x. Meanwhile, the additive group C/L, where 
L={ma, +nw,}, approaches the additive group C, i.e., the fundamental par- 
allelogram becomes all of C. 


. Let a> 0 in the elliptic curve y? = (x? — a)(x + 1). Show that for the curve y? = 


x?(x + 1) the same geometric procedure for finding P, + P, as for elliptic curves 
makes the smooth points of the curve into an abelian group. Show that the map 
which takes P = (x, y) to (y — x)/(y + x) (and takes the point at infinity to 1) 
gives an isomorphism with the multiplicative group C* of nonzero complex num- 
bers. This is called ‘“‘multiplicative degeneracy” of an elliptic curve. Draw the 
graph of the real points of y? = x?(x + 1), and show where the various sections 
go under the isomorphism with C*. One way to think of multiplicative degeneracy 
is to make the linear change of variables yr dy, x» —x — 4, so that the equation 
becomes y? = 4x? — $x — & (compare with Problem 8 of §I.6). So we are dealing 
with the limit as ¢ approaches infinity of the group C/{mit + nz}, ie., with the 
vertical strip C/{nz} (rather, a cylinder, since opposite sides are glued together), 
and this is isomorphic to C* under the map ze". 


§8. Points of finite order 


In any group, there is a basic distinction between elements of finite order 
and elements of infinite order. In an abelian group, the set of elements of 
finite order form a subgroup, called the “‘torsion subgroup”’. In the case of 
the group of points in P2 on the elliptic curve y? = f(x), we immediately see 
that a point P, = (x, y) has finite order if and only if NzeZL for some N, 
i.e., if and only if z is a rational linear combination of , and @,. In that 
case, the least such N (which is the least common denominator of the 
coefficients of wm, and w,) is the exact order of P,. Under the isomorphism 
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from R/Z x R/Z to the elliptic curve given by (a, b)H> Pro, tba,» it is the 
image of Q/Z x Q/Z which is the torsion subgroup of the elliptic curve. 

This situation is the two-dimensional analog of the circle group, whose 
torsion subgroup is precisely the group of all roots of unity, i.e., all e?"” 
for ze Q/Z. Just as the cyclotomic fields—the field extensions of @ generated 
by the roots of unity—are central to algebraic number theory, we would 
expect that the fields obtained by adjoining the coordinates of points 
P = (x, y) of order N on an elliptic curve should have interesting special 
properties. We shall soon see that these coordinates are algebraic (if the 
coefficients of f(x) are). This analogy between cyclotomic fields and fields 
formed from points of finite order on elliptic curves is actually much deeper 
than one might have guessed. In fact, a major area of research in algebraic 
number theory today consists in finding and proving analogs for such fields 
of the rich results one has for cyclotomic fields. 

Let N be a fixed positive integer. Let f(x) = ax? + bx? +cx+d= 
a(x — e€;)(* — e)(x — e3) be a cubic polynomial with coefficients in a field 
K of characteristic #2 and with distinct roots (perhaps in some extension 
of K). We are interested in describing the coordinates of the points of order 
N (i.e., exact order a divisor of NV) on the elliptic curve y? = f(x), where 
these coordinates may lie in an extension of K. If N = 2, the points of order 
N are the point at infinity 0 and (e;, 0), i= 1, 2, 3. Now suppose that N > 2. 
If N is odd, by a “nontrivial” point of order N we mean a point P 4 0 such 
that NP = 0. If N is even, by a “nontrivial’”’ point of order N we mean a 
point P such that NP = 0 but 2P 0. 


Proposition 13. Let K’ be any field extension of K (not necessarily algebraic), 
and let a: K’ > 0K’ be any field isomorphism which leaves fixed all elements 
of K. Let PEP. be a point of exact order N on the elliptic curve y? = f(x), 
where f(x)€K[x]. Then oP has exact order N (where for P = (x, y, z)€P2. 
we denote oP = (ox, ay, oz) € P2x,). 


PRooF. It follows from the addition formulas that oP, + oP, = o(P, + P,), 
and hence M(aP) = o(NP) = a0 = 0 (since o(0, 1, 0) = (0, 1, 0)). Hence oP 
has order N. It must have exact order N, since if N’oP =: 0, we would have 
o(N’P) = 0 = (0, 1, 0), and hence N’P = 0. This proves the proposition. O 


Proposition 14. In the situation of Proposition 13, with K a subfield of C, let 
Ky < C denote the field obtained by adjoining to K the x- and y-coordinates 
of all points of order N. Let Ky denote the field obtained by adjoining just 
their x-coordinates. Then both Ky and Kx are finite galois extensions of K. 


Proor. In each case Ky and Ky, we are adjoining a finite set of complex 
numbers which are permuted by any automorphism of C which fixes K. 
This immediately implies the proposition. Oo 


As an example, if N = 2, then K, = Kj is the splitting field of f(x) over K. 
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Recall that the group of points of order N on an elliptic curve in PZ is 
isomorphic to (Z/NZ) x (Z/NZ). Because any ¢ € Gal(Ky/K) respects addi- 
tion of points, i.e., o(P, + P,) = oP, + oP, it follows that each o gives an 
invertible linear map of (Z/NZ)* to itself. 

If R is any commutative ring, we let GL,(R) denote the group (under 
matrix multiplication) of all x n invertible matrices with entries in R. Here 
invertibility of a matrix A is equivalent to det Ae R*, where R* is the 
multiplicative group of invertible elements of the ring. For example: 


(1) GL,(R) = R*; 
(2) GL,(Z/NZ) = {C2 3 


a, b,c, deZ/NZ, ad — bce(Z/NZ)*}. 


It is easy to construct a natural one-to-one correspondence between invertible 
linear maps R" > R" and elements of GL,(R). There is no difference with 
the more familiar case when R is a field. 

In our situation of points of order N on an elliptic curve, we have seen 
that Gal(Ky/K) is isomorphic to a subgroup of the group of all invertible 
linear maps (Z/NZ)? > (Z/NZ)’. Thus, any o€ Gal(Ky/K) corresponds to 
a matrix (° 5)€GL,(Z/NZ). The matrix entries can be found by writing 


oP. in — Pgeo,[N+c00/N? oP.1N = Pre, |N+der,/N* 


Notice that this is a direct generalization of the situation with the 
N-th cyclotomic field Qy = OD. Recall that Gal(Q,/Q) x (Z/NZ)* = 
GL,(Z/NZ), with the element a which corresponds to o determined by 

o(e27/%) = e2ia(N 
But one difference in our two-dimensional case of division points on elliptic 
curves is that, in general, Gal(Ky/K) > GL,(Z/NZ) is only an injection, not 
an isomorphism. 

In the case K < C, say K = Q(gp, g3), where y? = f(x) = 4x* — gox — g3 
is in Weierstrass form, we shall now use the g-function to determine the 
polynomial whose roots are the x-coordinates of the points of order N. That 
is, Ky will be the splitting field of such a polynomial. 

We first construct an elliptic function fy(z) whose zeros are precisely the 
nonzero values of z such that P, is a point of order N. We follow the prescrip- 
tion in the proof of Proposition 9 of §1.5. If we¢C/L is a point of order N, 
then so is the symmetric point —u (which we denoted u* when we were 
thinking in terms of points in a fundamental parallelogram). We consider 
two cases: 


(i) N is odd. Then the points u and —u are always distinct modulo L. In 
other words, u cannot be w,/2, w,/2 or (wm, + w )/2 ifu has odd order N. 
We define 


Iv) = NT (9 — oe), (8.1) 
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where the product is taken over nonzero wE€C/L such that Nue L, with 
one u taken from each pair u, —u. Then fy(z) = Fy((z)), where Fy (x) € 
C[x] is a polynomial of degree (N? — 1)/2. The even elliptic function 
Jy(z) has N? — 1 simple zeros and a single pole at 0 of order N? — 1. 
Its leading term at z = 0 is N/z¥*>?. 

(ii) N is even. Now let u range over we C/L such that Nue L but u is not of 
order 2, 1.e.,u #0, w,/2, w/2, (w, + @,)/2. Define fy(z) by the product 
in (8.1). Then fy(z) = Fy(yv(z)), where Fy(x)€C[x] is a polynomial of 
degree (N* — 4)/2. The even elliptic function fy(z) has N? — 4 simple 
zeros and a single pole at 0 of order N? — 4. Its leading term at z = 0 is 
Niz¥?-4, 


If N is odd, the function f(z) has the property that 
Icy =N* [TT (@)— ew). 


O#ueC/L.NueL 
If N is even, then the function fy(z) & —4$'(z) fy (2) has the property that 
In(2)? = 49 (2)°folZ)? 
= N*(@(z) — e;)(@(z) — e2)(@(Z) — €3) I] (y(z) — p(u)) 
we C/L.Nue L. 2ug L 


= JN? I] (@®- 9). 


O#ueC/L.NueL 


We see that a point (x, y) = ((z), 9’(z)) has odd order N if and only if 
Fy(x) = 0. It has even order AN if and only if either y = 0 (ie., it is a point of 
order 2) or else Fy(x) = 0. 

Because of Propositions 13 and 14, we know that any automorphism of 
C fixing K = Q(g,, g3) permutes the roots of Fy. Hence, the coefficients of 
Fy are in K = Q(g3, g3). 

If we started with an elliptic curve not in Weierstrass form, say y? = 
I(x) = ax? + bx* + cx +d, and if we wanted to avoid using the g-function, 
then we could repeatedly apply the addition formulas (7.1)-(7.4) to compute 
the rational function of x and y which is the x-coordinate of NP, where 
P=(x,y). We would simplify algebraically as we go, making use of the 
relation y? = f(x), and would end up with an expression in the denominator 
which vanishes if and only if WP is the point at infinity, i.e., if and only if 
P has order N (recall: “order NV” means ‘‘exact order N or a divisor of N”’). 

What type of an expression would we have to get in the denominator of 
the x-coordinate of NP? Suppose, for example, that N is odd. Then this 
denominator would be an expression in K[x, )] (with y occurring at most to 
the first power), where K = Q(a, b,c, d), which vanishes if and only if x is 
one of the (NV? — 1)/2 values of x-coordinates of nontrivial points of order N. 
Thus, the expression must be a polynomial in x alone with (NV — 1)/2 roots. 
Similarly, we find that when N is even, this denominator has the form 
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y+(polynomial in x alone), where the polynomial in K[x] has (N* — 4)/2 
roots. 

It is important to note that the algebraic procedure described in the last 
two paragraphs applies for any elliptic curve y* = f(x) over any field K of 
characteristic # 2, not only over subfields of the complex numbers. Thus, 
for any K we end up with an expression in the denominator of the x- 
coordinate of NP that vanishes for at most N? — 1 values of (x, y). 

For a general field K, however, we do not necessarily get exactly N? — 1 
nontrivial points of order N. Of course, if K is not algebraically closed, the 
coordinates of points of order N may lie only in some extension of K. 
Moreover, if K has characteristic p, then there might be fewer points of order 
N for another reason: the leading coefficient of the expression in the denom- 
inator vanishes modulo p, and so the degree of that polynomial drops. We 
shall soon see examples where there are fewer than N* points of order N 
even if we allow coordinates in K*'£*). 

This discussion has led to the following proposition. 


Proposition 15. Let y? = f(x) be an elliptic curve over any field K of characteris- 
tic not equal to 2. Then there are at most N? points of order N over any exten- 
sion K’ of K. 


Now let us turn our attention briefly to the case of K a finite field, in 
order to illustrate one application of Proposition 15. We shall later return 
to elliptic curves over finite fields in more detail. 

Since there are only finitely many points in PF (namely, q?7+ 4+ 1), 
there are certainly only finitely many F,-points on an elliptic curve y? = f(x), 
where f(x)€F,[x]. So the group of F,-points is a finite abelian group. 

By the “type” of a finite abelian group, we mean its expression as a 
product of cyclic groups of prime power order. We list the orders of all 
of the cyclic groups that appear in the form: 22, 272, 2”, ..., 3%, 33, 373, 
..., 575, 585, .... But Proposition 15 implies that only certain types can 
occur in the case of the group of F,-points on y? = f(x). Namely, for each 
prime / there are at most two /-th power components /*!, /*1, since otherwise 
we would have more than /” points of order /. And of course /”'**! must equal 
the power of / dividing the order of the group. 

As an example of how this works, let us consider the elliptic curve y* = 
x* — n?x over K =F, (the finite field of g = p’ elements), where we must 
assume that p does not divide 2n. In the case when gq =3 (mod 4), it is 
particularly easy to count the number of F,-points. 


Proposition 16. Let q = p!, p{2n. Suppose that q = 3 (mod 4). Then there are 
q + 1 F,-points on the elliptic curve y* = x? — n?x. 


ProoF. First, there are four points of order 2: the point at infinity, (0, 0), 
and (+n, 0). We now count all pairs (x, y) where x # 0,1, —n. We arrange 
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these q — 3 x’s in pairs {x, —x}. Since f(x) = x? — n*x is an odd function, 
and —1 is not a square in F, (here’s where we use the assumption that q = 3 
(mod 4)), it follows that exactly one of the two elements f(x) and f(— x) = 
—f(x) is a square in F,. (Recall: In the multiplicative group of a finite field, 
the squares are a subgroup of index 2, and so the product of two nonsquares 
is a square, while the product of a square and a nonsquare is a nonsquare.) 
Whichever of the pair x, —x gives a square, we obtain exactly two points 
(x, +./f(x)) or else (—x, +./f(—x)). Thus, the (¢ — 3)/2 pairs give us 
q — 3 points. Along with the four points of order two, we have q + 1 F,- 
points in all, as claimed. Oo 


Notice that when g = 3 (mod 4), the number of F,-points on the elliptic 
curve y? = x> — n?x does not depend on a. This is not true if ¢g = 1 (mod 4). 

As an example, Proposition 16 tells us that for g = 7° there are 344 = 
23 - 43 points. Since there are four points of order two, the type of the group 
of F43-points on y? = x? — n?x must be (2, 27, 43). 

As a more interesting example, let g = p = 107. Then there are 108 = 
27-33 points. The group is either of type (2, 2, 3°) or of type (2, 2, 3, 37). 
To resolve the question, we must determine whether there are 3 or 9 points 
of order three. (There must be nontrivial points of order 3, since 3 divides 
the order of the group.) Recall the equation for the x-coordinates of points 
of order three (see Problem 4 of §7): —3x*+ 6n*x* +n*=0, ie, x= 
tnv1l+ 2,/3/3. Then the corresponding y-coordinates are found by taking 
+./f(x). We want to know how many of these points have both coordinates 
in F,97, rather than an extension of F,,.,. We could compute explicitly, using 
J3 = +18 in F,97, so that x = +,/13, +,/—11, etc. But even before doing 
those computations, we can see that not all 9 points have coordinates in 
F107. This is because, if (x, y) is in Fy 97, then (—x, ./—1y) is another point 
of order three, and its coordinates are not in F, 97. Thus, there are only 3 
points of order three, and the type of the group is (2, 2, 3%). 

Notice that if K is any field of characteristic 3, then the group of K-points 
has no nontrivial point of order three, because —3x* + 6n*x? + n* =n* £0. 
This is an example of the ‘dropping degree”’ phenomenon mentioned above. 
It turns out that the same is true for any p = 3 (mod 4), namely, there are no 
points of order p over a field of characteristic p in that case. This is related 
to the fact that such p remain prime in the ring of Gaussian integers Z[i], 
a ring which is intimately related to our particular elliptic curve (see Problem 
13 of §6). But we will not go further into that now. 


PROBLEMS 


1. For the elliptic curve y? = 4x3 — g,x — g3, express 49(Nz) as a rational function 
of go(z) when N = 2. 


2. Let fy(z) be the elliptic functions defined above. Express f;(z) as a polynomial in 
(2). 
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3. Set f,(z) = 1. Prove that for N = 2, 3,4, ... we have: 
(Nz) = @(2) — fy-1 2) fyi Dliv(Z)- 


4. In the notation of Proposition 14, suppose that ¢ € Gal(Ky/K) fixes all x-coordinates 
of points of order N. That is, ol Ks = identity. Show that the image of 0 in GL,(Z/NZ) 
is +1. Conclude that Gal(Ky/Ky) = {+1} 1G, where G is the image of Gal(Ky/K) 
in GL,(Z/N Z). What is the analogous situation for cyclotomic fields? 


5. Let L = {m@, + no }, and let E be the elliptic curve y* = 4x3 — g,(L)x — g3(L). 
Notice that E does not change if we replace the basis {w,,  } of L by another 
basis {w}, w,}. However, the group isomorphism C/L = R/Z x R/Z changes, and 
so does the isomorphism from the points of order N on Eto Z/NZ x Z/NZ. For 
example, the point ((w}/N), 9’(@/N)), rather than (~(@,/N), 9’(w,/N)), corre- 
sponds to (1, 0)eZ/NZ x Z/NZ. What effect does the change of basis from a; to 
«; have on the image of Gal(Ky/K) in GL,(Z/N Z)? 


6. Show that the group GL,(Z/2Z) is isomorphic to S3, the group of permutations of 
{1, 2, 3}. For each of the following elliptic curves, describe the image in GL,(Z/2Z) 
of the galois group over @ of the field generated by the coordinates of the points 
of order 2. 

(a) y? =x? — nx (n not a perfect square) 


—n (n not a perfect cube) 


7. (a) How many elements are in GL,(Z/3Z)? 
(b) Describe the field extension K, of K = @ generated by the coordinates of all 
points of order 3 on the elliptic curve y? = x* — nx. 
(c) Find [K;: Q]. What subgroup of GL,(Z/3Z) is isomorphic to Gal(K,/Q)? 
(d) Give a simple example of an element in GL,(Z/3Z) that is not in the image of 
Gal(K;/Q); in other words, find a pair of elements 2, = (m,@, + 1, )/3, 
Z2= Glin: + n,@ )/3 which generate all (mw, + n@ )/3 but such that P,,, P,, 
cannot be obtained from P,,j3, P.,,,3 by applying an automorphism to the 
coordinates of the latter pair of points. 


8. In Problem 13 of §1.6, we saw that the lattice corresponding to the curve y* = 
x? — n*x is the lattice L of Gaussian integers expanded by a factor w,e R: L= 
{miw, + nw,} = w,Z[i]. 

(a) Show that the map z+ iz gives an analytic automorphism of the additive group 
C/L; and, more generally, for any Gaussian integer a + bieZ[i] we have a 
corresponding analytic endomorphism of C/L induced by z+ (a + bi)z. 

(b) Notice that if 5 = 0, this is the map z->z+2+ --- +z (a times) which gives 
o,: PteaP on the elliptic curve. By looking at the definition of g(z), 9’(z), 
show that the map z+» iz gives the automorphism ¢;: (x, vy) (—x, iy) on the 
elliptic curve. This is an example of what’s called “complex multiplication”. 
Show that ¢;0¢; = ¢_,, and in fact the map a+ bit @,,,; is an injection of 
the ring Z[i] into the ring of endomorphisms of the group of points on the 
elliptic curve. 

(c) If L is a lattice in C and if there exists a complex number « = a + bi, b #0, 
such that a ¢ L, show that « is a quadratic imaginary algebraic integer, and 
that L contains a sublattice of finite index of the form w,Z[«]. 
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9. Each of the following points has finite order N on the given elliptic curve. In each 
case, find its order. 
(a) P= (0, 4) on y? = 4x? + 16 
(b) P= (2, 8) ony? = 4x? + 16x 
(c) P=(2,3)ony?=x34+1 
(d) P= (3, 8) ony? = x? — 43x + 166 
(e) P= (3, 12) ony? = x3 — 14x? + 81x 
(f) P=(0, 0) ony? +y=x3 — x? 
(g) P=(1, 0) ony? +xy t+ y=x9 — x? — 3x43. 


§9. Points over finite fields, and the congruent 
number problem 


We have mainly been interested in elliptic curves E over Q, particularly the 
elliptic curve y* = x — n*x, which we shall denote E,,. But if K is any field 
whose characteristic p does not divide 2, the same equation (where we 
consider n modulo p) is an elliptic curve over K. We shall let E,(K) denote 
the set of points on the curve with coordinates in K. Thus, Proposition 16 
in the last section can be stated: If q = 3 (mod 4), then #E,(F,) =q + 1. 

The elliptic curve E,, considered as being defined over F,,, is called the 
“reduction” modulo p, and we say that E, has “good reduction” if p does 
not divide 2n, i.e., if y? = x* — n’x gives an elliptic curve over F,. More 
generally, if y* = f(x) is an elliptic curve E defined over an algebraic number 
field, and if p is a prime ideal of the number field which does not divide the 
denominators of the coefficients of f(x) or the discriminant of f(x), then by 
reduction modulo p we obtain an elliptic curve defined over the (finite) 
residue field of p. 

At first glance, it may seem that the elliptic curves over finite fields— 
which lead only to finite abelian groups—are not a serious business, and 
that reduction modulo p is a frivolous game that will not help us in our 
original objective of studying Q-points on y? = x? — nx. However, this is 
far from the case. Often information from the various reductions modulo p 
can be pieced together to yield information about the Q-points. This is 
usually a subtle and difficult procedure, replete with conjectures and unsolved 
problems. However, there is one result of this type which is simple enough 
to give right now. Namely, we shall use reduction modulo p for various 
primes p to determine the torsion subgroup of E,(Q), the group of Q-points 
on y? =x> —n?x. 

In any abelian group, the elements of finite order form a subgroup, 
called the “torsion subgroup”. For example, the group E(C) of complex 
points on an elliptic curve is isomorphic to C/L, which for any lattice L is 
isomorphic to R/Z x R/Z (see Problem 2 of §1.5). Its torsion subgroup 
corresponds to the subgroup Q/Z x Q/Z < R/Z x R/Z, i., in C/L it 
consists of all rational linear combinations of w, and @,. 

A basic theorem of Mordell states that the group E(Q) of Q-points on an 
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elliptic curve E defined over Q is a finitely generated abelian group. This 
means that (1) the torsion subgroup £(Q),,,, is finite, and (2) E(Q) is iso- 
morphic to the direct sum of £(Q),,,, and a finite number of copies of 
Z: E(Q) = E(Q),,,, ® Z’. The nonnegative integer r is called the “rank” of 
E(Q). It is greater than zero if and only if E has infinitely many Q-points. 
Mordell’s theorem is also true, by the way, if Q is replaced by any algebraic 
number field. This generalization, proved by André Weil, is known as the 
Mordell—Weil theorem. We shall not need this theorem for our purposes, 
even in the form proved by Mordell. For a proof, the reader is referred to 
[HusemGller 1987] or [Lang 1978b]. 

We shall now prove that the only rational points of finite order on E£, are 
the four points of order 2: 0 (the point at infinity), (0, 0), (+n, 0). 


Proposition 17. # E,(Q),o.. = 4 


Proor. The idea of the proof is to construct a group homomorphism from 
E,(Q)tor5 to E,(F,,) which is injective for most p. That will imply that the 
order of £,(Q),,,, divides the order of £,(F,) for such p. But no number 
greater than 4 could divide all such numbers # E,(F,), because we at least 
know that #£,(F,) runs through all integers of the form p + | for pa prime 
congruent to 3 modulo 4 (see Proposition 16). 

We begin the proof of Proposition 17 by constructing the homomorphism 
from the group of Q-points on E, to the group of F,-points. More generally, 
we simply construct a map from P@ to P; In what follows, we shall always 
choose a triple (x, y, z) for a point in Pg in such a way that x, y, and z are 
integers with no common factor. Up to multiplication by +1, there is a 
unique such triple in the equivalence class. For any fixed prime p, we define 
the image P of P = (x, y, z)€ PZ to be the point P = (x, J, Ze PE, where 
the bar denotes reduction of an integer modulo p. Note that P is not the 
identically zero triple, because p does not divide all three integers x, y, 2 
Also note that we could have replaced the triple (x, y, z) by any multiple 
by an integer prime to p without affecting P. 

It is easy to see that if P = (x, y, z) happens to be in E,(Q), i-e., if y?z = 
x? — n°xz?, then P is in E,(F,). Moreover, the image of P, + P under this 
map is P, + P,, because it makes no difference whether we use the addition 
formulas (7.1)—(7.4) to find the sum and then reduce mod p, or whether we 
first reduce mod p and then use the addition formulas. In other words, our 
map is a homomorphism from £,(Q) to £,(F,), for any prime p not dividing 
2n. 

We now determine when this map is not injective, i.e., when two points 
P, =(x,,);,2;) and P, = (x3, ¥2, 22) in PQ have the same image P, = P; 
in P7.. 

Lemma. P, = P, if and only if the cross-product of P, and P, (considered as 
vectors in R°) is divisible by p, i.e., if and only if p divides y, 2. — 724, X22, — 
X22, aNd X,V_— X)y. 
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PROOF OF LEMMA. First suppose that p divides the cross-product. We consider 
two cases: 


(i) p divides x,. Then p divides x,z, and x,y,, and therefore divides x, 
because it cannot divide x,, y, and z,. Suppose, for example, that p/y, 
(an analogous argument will apply if piz,). Then Ps =:(0; ¥i Vos Vi Zs) = 
(0, ¥; ¥25 ¥2Z,) = (0, ¥,, Z,) = P, (where we have used the fact that p 
divides y,Z. — 2Z,). - 

(ii) p does not divide x,. Then P, = (%1X2, X12, X172) = (X,X2, X2),, 
X271) = (%1,9,,7,) =P 

Conversely, suppose that P, = P,. Without loss of generality, suppose 
that p/x, (an analogous argument will apply if p/y, or p}z,). Then, since 
Pi ee (X2, 92,72), we also have p/x,. Hence, (XX, ¥, V2, XZ.) = 
Pee = (¥,X1, ¥2Y,, X27Z,). Since the first coordinates are the same, these 
two hare can be equal only if the second and third coordinates are equal, 
ie., if p divides x, y. — x,y, and x,z, — X»Z,. Finally, we must show that p 
divides y,z, — y2z,. If both y, and z, are divisible by p, then this is trivial. 
Otherwise, the conclusion will follow by repeating the above argument with 
x1, xX, replaced by y,, y, or by z,, Zz. This concludes the proof of the lemma. 

We are now ready to prove Proposition 17. Suppose that the proposition 
is false, i.e., that E,(@) contains a point of finite order greater than 2. Then 
either it contains an element of odd order, or else the group of points of 
order 4 (or a divisor of 4) contains either 8 or 16 elements. In either case we 
have a subgroup S = {P,, P,, ..., P,} © E,(Q)iors, Where m = #S is either 
8 or else an odd number. 

Let us write all of the points P,, i= 1, ..., m, in the form in the lemma: 
P; = (X;, V;, Z,). For each pair of points P;, P,, consider the cross-product 
vector (4:2) — Yj2is Xj2i — XiZjn Xj — x;y,)€R*. Since P, and P, are distinct 
points, as vectors in Rs ‘hey are not proportional, and so their cross-product 
is not the zero vector. Let n, be the greatest common divisor of the coor- 
dinates of this cross-product. According to the lemma, the points P, and P, 
have the same image P, = Pp in E,(F,) if and only if p divides n,;. Thus, if p 
is a prime of good reduction which is greater than all of the ny, it follows 
that all images are distinct, i.e., the map reduction modulo p gives an injection 
of Sin E,(F,). 

But this means that for all but finitely many p the number m must divide 
#£E,(F,), because the image of S is a subgroup of order m. Then for all but 
finitely many primes congruent to 3 modulo 4, by Proposition 16 we must 
have p = —1 (mod m). But this contradicts Dirichlet’s theorem on primes 
in an arithmetic progression. Namely, if m = 8 this would mean that there 
are only finitely many primes of the form 8k + 3. If m is odd, it would 
mean that there are only finitely many primes of the form 4mk + 3 (if 3/m), 
and that there are only finitely many primes of the form 12k + 7 if 3|m. In 
all cases, Dirichlet’s theorem tells us that there are infinitely many primes of 
the given type. This concludes the proof of Proposition 17. oO 
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Notice how the technique of reduction modulo p (more precisely, the use 
of Proposition 16 for infinitely many primes p) led to a rather painless proof 
of a strong fact: There are no ‘‘non-obvious”’ rational points of finite order 
on E,. As we shall soon see, this fact is useful for the congruent number 
problem. But a far more interesting and difficult question is the existence 
of points of infinite order, i.e., whether the rank r of E,(Q) is nonzero. As 
we shall see in a moment, that question is actually equivalent to the question 
of whether or not 7 is a congruent number. 

So it is natural to ask whether mod p information can somehow be put 
together to yield information about the rank of an elliptic curve. This 
subtle question will lead us in later chapters to consideration of the Birch— 
Swinnerton—Dyer conjecture for elliptic curves. 

For further general motivational discussion of elliptic curves over finite 
fields, see [Koblitz 1982]. 

We now prove the promised corollary of Proposition 17. 


Proposition 18. 1 is a congruent number if and only if E,(Q) has nonzero 
rank r. 


ProorF. First suppose that n is a congruent number. At the beginning of §2, 
we saw that the existence of a right triangle with rational sides and area n 
leads to a rational point on E, whose x-coordinate lies in (Q*)*. Since the 
x-coordinates of the three nontrivial points of order 2 are 0, +n, this means 
that there must be a rational point not of order 2. By Proposition 17, such a 
point has infinite order, 1.e., 7 > 1. 

Conversely, suppose that P is a point of infinite order. By Problem 2(c) 
of §1.7, the x-coordinate of the point 2P is the square of a rational number 
having even denominator. Now by Proposition 2 in §I.2, the point 2P 
corresponds to a right triangle with rational sides and area n (under the 
correspondence in Proposition 1). This proves Proposition 18. Oo 


Notice the role of Proposition 17 in the proof of Proposition 18. It tells 
us that the only way to get nontrivial rational points of the form 2P is from 
points of infinite order. Let 2E,(Q) denote the subgroup of £,,(Q) consisting 
of the doubles of rational points. Then Proposition 17 is equivalent to the 
assertion that 2E,(Q) is a torsion-free abelian group, i.e., it is isomorphic 
to acertain number of copies (namely, r) of Z. The set 2E,(Q@) — 0 (0 denotes 
the point at infinity) is empty if and only if r = 0. 

We saw that points in the set 2E,(Q) — 0 lead to right triangles with 
rational sides and area n under the correspondence in Proposition 1. It is 
natural to ask whether all points meeting the conditions in Proposition 2, 
i.e., corresponding to triangles, are doubles of points. We now prove that 
the answer is yes. At the same time, we give another verification of Proposi- 
tion 18 (not relying on the homework problem 2(c) of §I1.7). 


Proposition 19. There is a one-to-one correspondence between right triangles 
with rational sides X < Y < Z and area n, and pairs of points (x, +y)é 
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2E,,(Q) — 0. The correspondence is: 


(x, ty) x tn — Jx—n, Jx tnt [x —n, 2/x; 


X, Y, Z(Z2/4, +(¥? — X2)Z/8). 


In light of Proposition | of §I.1, Proposition 19 is an immediate conse- 
quence of the following general characterization of the doubles of points on 
elliptic curves. 


Proposition 20. Let E be the elliptic curve y? = (x — e,)(x — e,)(x — e3) with 
€1,€7,€3€Q. Let P = (Xo, Wo) € E(Q) — 0. Then Pe 2E(Q) — 0 if and only if 
Xo — €y, Xo — 2, Xo — €3 are all squares of rational numbers. 


PRooF. We first note that, without loss of generality, we may assume that 
X 9 = 0. To see this, make the change of variables x’ = x — xg. By simply 
translating the geometrical picture for adding points, we see that the point 
P’ =(0, yo) on the curve E’ with equation y? = (x — e4)(x — e3)(x — e3), 
where e; = e; — Xo, is in 2E’(Q) — 0 if and only if our original P were in 
2E(Q) — 0. And trivially, the x) — e; are all squares if and only if the (0 — e/) 
are. So it suffices to prove the proposition with x, = 0. 

Next, note that if there exists Qe E(Q) such that 2Q =: P, then there are 
exactly four such points Q, Q,, Q,, Q,¢ E(Q) with 2Q, = P. To obtain Q,, 
simply add to Q the point of order two (e;, 0) € E(Q) (see Problem 5 in §I.7). 

Choose a point Q = (x, y) such that 2Q = P = (0, y,). We want to find 
conditions for the coordinates of one such Q (and hence all four) to be 
rational. Now a point Q on the elliptic curve satisfies 20 = P if and only if 
the tangent line to the curve at Q passes through — P = (0, —y,). That is, 
the four possible points Q are obtained geometrically by drawing the four 
distinct lines emanating from — P which are tangent to the curve. 

We readily verify that the coordinates (x, y) are rational if and only if the 
slope of the line from —P to Q is rational. The “only if” is immediate. 
Conversely, if this slope m is rational, then the x-coordinate of Q, which is 
the double root of the cubic (mx — yo)? = (x — e,)(x — e,)(x — e3), must 
also be rational. (Explicitly, x = (e, +e, +e, +m7’)/2.) In this case the 
y-coordinate of Q is also rational: y = mx — yo. Thus, we want to know 
when one (and hence all four) slopes of lines from — P which are tangent to 
EF are rational. 

A number meéC is the slope of a line from — P which is tangent to E if 
and only if the following equation has a double root: 


(mx — Yo)? = (x — ey)(x — ey) (x —e3) =x? +.ax? + bx +c, (9.1) 
with 


a= —e,—e,—63, b=e,e, + ee, + €3e3, C= —€10,€, = YZ, 
(9.2) 


where the last equality c = yg comes from the fact that (0, yo) is on the curve 
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y? =x? + ax? + bx +c. Now if we simplify (9.1) and factor out x, our 
condition becomes: the following quadratic equation has a double root: 


x? + (a—m?’)x + (b + 2my) = 0. 
This is equivalent to saying that its discriminant must vanish, i.e., 
(a — m’)* — 4(6 + 2my,) = 0. (9.3) 


Thus, our task is to determine when one (and hence all four) roots of this 
quartic polynomial in m are rational. 

We want to find a condition in terms of the e,’s (namely, our claim is that 
an equivalent condition is: —e;¢Q7). In (9.3), the a and b are symmetric 
polynomials in the e;, but the yp is not. However, yo is a symmetric polyno- 
mial in the Je; . That is, we introduce f, satisfying f;* = —e;. There are two 
possible choices for f,, unless e; = 0. Choose the f; in any of the possible 
ways, subject to the condition that yo =f, 2/3. If all of the e; are nonzero, 
this means that the sign of f, and f, are arbitrary, and then the sign of f, is 
chosen so that yy and f, f,f; are the same square root of —e,é2é3. If, say, 
e, = 0, then either choice can be made for the sign of f,, f,, and of course 
fy = 0. In all cases there are four possible choices of the /;’s consistent with 
the requirement that yo = f, 4/3. Once we fix one such choice f,, fi, fs, we 
can list the four choices as follows (here we’re supposing that e, and e, are 
nonzero): 


hihi fs; tis —h, has fis dos a5 Tus =fr, Fy. (9.4) 


The advantage of going from the e,’s to the f,’s is that now the coefficients 
of our equation (9.3) are symmetric functions of /,, f2, f;. More precisely, 


if we set, =f[p +h th, 2=hhthht+hh» 83 =hhss, the elemen- 
tary symmetric functions, then 


a=fP the +fP = sj — 252; 
b = f2p? + S768 + SLAF = 83 — 25153; 
Yo = 83: 
Thus, equation (9.3) becomes 
0 = (m? — s? + 2s,)? — 4(53 — 25,53 + 2ms3) 
1 2 2 1i°3 3 (9.5) 


= (m? — s?)? + 4s,(m? — sf) — 853(m — 5,). 


We see at a glance that the polynomial in (9.5) is divisible by m — 5,, i.e., 
m=s,=f,+f+/f; is a root. Since we could have made three other 
choices for the signs of the f, the other roots must correspond to these 
choices, i.e., the four solutions of equation (9.3) are: 


m=fhtht+h, m,=fi—-h—hs 
m= -fth—-h, m= fh-hth- 


(9.6) 
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We want to know whether the four values in (9.6) are rational. Clearly, 
if all of the f; are rational, then so are the m;. Conversely, suppose the m; 
are rational. Then /, = (m, + m,)/2, fp = (m, + m3)/2, and fy = (m, + m,)/2 
are rational. The conclusion of this string of equivalent conditions is: the 
coordinates (x, y) of a point Q for which 2Q = P are rational if and only 
if the f, = ./ —e; are rational. This proves Proposition 20. oO 


Finally, we note that Proposition 20 holds with Q replaced by any field 
K not of characteristic 2. Essentially the same proof applies. (We need only 
take care to use algebraic rather than geometric arguments, for example, 
when reducing to the case P = (0, yo).) 


PROBLEMS 


1. Prove that for fodd, any F,s-point of order 3 on the elliptic curve E,: y? = x? — n?x 


is actually an F,-point; prove that there are at most three such points if p = 3 
(mod 4); and find a fairly good sufficient condition on p and f which ensures nine 
F,¢-points of order 3. 


2. For each of the following values of q, find the order and type of the group of 
F,-points on the elliptic curve E, : y? = x? — x. In all cases, find the type directly, 
if necessary checking how many points have order 3 or 4. Don’t “peek” at the 
later problems. 

(a) All odd primes from 3 to 23. 
(b) 9 

(c) 27 

(d) 71 

(e) 113. 


3. Find the type of the group of F,-points on the elliptic curve E,: y? = x* — 25x 
for all odd primes p of good reduction up to 23. 


4. Prove that for nonzero ae Q the equation y? = x° — adetermines an elliptic curve 
over any field K whose characteristic p does not divide 6 or the numerator or 
denominator of a; and that it has qg + 1 F,-points if q = 2 (mod 3). 


5. Prove that there are exactly 3 F,-points of order 3 on the elliptic curve in Problem 4 
if ¢g = 2 (mod 3). 


6. For all odd primes p from S to 23, find the order and type of the group of F,-points 
on the elliptic curve y? = x3 — 1. 


7. Prove that the torsion subgroup of the group of Q-points on the elliptic curve 
y? = x? — ahas order dividing 6 and that its order is equal to: 
(a) 6if a= —b® for some be Q; 
(b) 2 if a=c? for some ce Q with c not of the form —d?; 
(c) 3 if either a= —d? for some deQ with d not of the form 5, or if a = 432b° 
for some be Q; 
(d) 1 otherwise. 


8. Show that the correspondence constructed in Problem 2 of §I.2 gives a one-to-one 
correspondence between right triangles as in Proposition 19 and pairs +P of 
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non-identity elements of the quotient group E,(Q)/E,(Q)torsions Which is isomorphic 
to 2E,(Q) under the map Pt>2P. See Problem 2(b) of §I.7. 


In the problems below, we illustrate how more information can be obtained 
using two additional tools: (1) the complex multiplication automorphism 
(x, y)>(—x, ./—1y) of the group of K-points of the elliptic curve ps 


x? 


—n?’x if K contains a square root of —1; (2) the action of Gal(K*®°!/K) 


on the coordinates of the K*'**!-points. 


9. 


10. 


11. 


12. 


13. 


Suppose that g = 3 (mod 4), and / is an odd prime. Prove that: 

(a) there are at most / F,-points of order / on the elliptic curve y? = x3 — nx, 
and there are at most eight F,-points of order 4; 

(b) the group of F,-points is the product of a group of order 2 and a cyclic group 
of order (¢ + 1)/2. 


Suppose that g = 2 (mod 3), 2/.N, 3N. Prove that there are at most N F,-points 
of order N on the elliptic curve y? = x* — a. 


Suppose that g = 1 (mod 4), and /=3 (mod 4) is a prime not equal to p. Let 
(7, 1°) be the /-part of the type of the group of F,-points on the elliptic curve y? = 
x3 —n*x. Prove that « = B. If /= 2, prove thata = fBora=fPtl. 


The group of K-points on an elliptic curve is analogous to the multiplicative group 

K*. In Problem 11 of §1.7, we saw that for K = C, as a 0 the elliptic curve y= 

(x? — a)(x + 1) “becomes” the multiplicative group C*. Now let K be the finite 

field F,. In this problem we work with K*, and in the next problem we work with 

the group of K-points on an elliptic curve. Let / be a prime not equal to p, and 
suppose that F, contains all /-th roots of 1, 1.e., q = p! = 1 (mod /). 

(a) Show that the splitting field of x'— a, where aeF,, has degree either 1 or I 
over F,. 

(b) Show that the subfield of F2!**! generated by all /”*!-th roots of | is Fy", 
where MW’ < M. 

(c) (For readers who know about adic numbers.) Construct an isomorphism 
between the additive group Z, of /-adic integers and the galois group over F, 
of the field extension generated by all /-th power division points (i.e., /-th power 
roots of unity). 


Now let E be an elliptic curve defined over F,. Suppose that there are / ? F,-points 

of order /. 

(a) Let A be an F,-point, and let F,, be the extension of F, generated by the co- 
ordinates of a solution « to the equation /x = A (i.e., F,r is the smallest extension 
of F, containing such an a). Show that there are / ? Fyr-points a; such that 
la; = A. 

(b) Fix an F,r-point « such that /x = A. Prove that the map ot o(a) — gives an 
imbedding of Gal(F,+/F,) into the group of points of order / on E. 

(c) Show that r= 1 or /. 

(d) What is the field extension of F, generated by all points of order IM M=1, 
2, ...? What is its galois group? 


CHAPTER II 


The Hasse—Weil Z-Function of an 
Elliptic Curve 


At the end of the last chapter, we used reduction modulo p to find some 
useful information about the elliptic curves E,: y? = x? -- n?x and the con- 
gruent number problem. We considered E, as a curve over the prime field 
F, where p}2n; used the easily proved equality #£E,(F,) =p +1 when 
p =3 (mod 4); and, by making use of infinitely many such p, were able 
to conclude that the only rational points of finite order on E, are the four 
obvious points of order two. This then reduced the congruent number 
problem to the determination of whether r, the rank of E,(Q), is zero or 
greater than zero. 

Determining r is much more difficult than finding the torsion group. Some 
progress can be made using the number of F,-points. But the progress does 
not come cheaply. First of all, we will derive a formula for #E,(F,) for any 
prime power q =p’. Next, we will combine these numbers N, = Np = 
#£,(F,r) into a function which is analogous to the Riemann zeta-function 
(but more complicated). The behavior of this complex-analytic function 
near the point | is intimately related to the group of rational points. 

Before introducing this complex-analytic function, which is defined using 
all of the N,_,, we introduce a much simpler function, called the “congruence 


r,p? 
zeta-function”, which is built up from the N, = N,,, for a fixed prime p. 


§1. The congruence zeta-function 


Given any sequence N,, r = 1, 2, 3, ..., we define the corresponding “‘zeta- 
function” by the formal power series 


foe) T’ 00 k 
Z(T) = exp (x NT). where exp(w) = eae (1.1) 
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At first glance, it might seem simpler to define Z(T) as = N,T"; however, 
the above definition has crucial properties which make it the most useful 
one (see the problems below). 

Let K be a field. Let AX denote the set of m-tuples of elements of K. 
By an “affine algebraic variety in m-dimensional space over K”’ we mean 
a system of polynomial equations of the form f(x,, ..., X,,) = 0, where 
Sie Kx, ..., XJ. For example, a conic section is a system of two equations 


Ay =x +y?-2=0;  flx,y,2) =axtby+er+d=0 


in 3-dimensional space over R. If Lis any field extension of K, the ‘‘L-points” 
of the variety are the m-tuples (x,, ..., X,,)€A7 for which all of the poly- 
nomials f; vanish. 

By a “projective variety in m-dimensional space over K”’ we mean a system 
of homogeneous polynomial equations f,(xo, X,, ..-, X»)inm + 1 variables. 
If L is a field extension of K, the ““L-points”’ of the projective variety are the 
points in P7 (i.e., equivalence classes of m + 1-tuples (xg, ..., Xj), where 
(X9, + +5 Xm) ~ (AXq, +++ AXm), 2€L*) at which all of the f vanish. For 
example, in the last chapter we studied the F,-points of the elliptic curve 
defined in P?. by the single equation f(x, y, z) = y?z — x3 + n?xz? =0. 
(Note: Here Xo = 2Z,X, =X, X,=y are variables for a projective variety 
in PZ, while in the last paragraph x, = x, x) = y, x3 = Z were variables for 
an affine variety in Aj.) 

If we have a projective variety, by setting x) = 1 in the f; we obtain an 
affine variety whose L-points correspond to the m + 1-tuples with nonzero 
first coordinate. The remaining L-points of the projective variety will be 
the projective variety in PZ”! obtained by setting x, = 0 in all of the equa- 
tions and considering the equivalence classes of m-tuples (x,, ..., X,) which 
satisfy the resulting equations. For example, the elliptic curve with equation 
y?z — x3 + n?xz* consists of the affine points—the solutions of y? = x* — 
n?x—and the points (x, y) of Pk for which —x* = 0, i.e., the single point 
(0, 1) on the line at infinity z = 0. 

Let V be an affine or projective variety defined over F,. For any field 
K>F,, we let V(K) denote the set of K-points of V. By the “congruence 
zeta-function of V over F,”’ we mean the zeta-function corresponding to the 
sequence N, = #V(F,r). That is, we define 


Z(V/F,; T) = exp b #V(E,») rr) (1.2) 
r=1 


Of course, WN, is finite, in fact, less than the total number of points in AF . 
(in the affine case) or PE _(in the projective case). 

We shall be especially interested in the situation when V is an elliptic 
curve defined over F,. This is a special case of a smooth projective plane 
curve. A projective plane curve defined over a field K is a projective variety 
given in PZ by one homogeneous equation f(x, y, z) = 0. Such a curve is 
said to be “‘smooth”’ if there is no K*'®*'-point at which all partial derivatives 
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vanish. This agrees with the usual definition when K = C (‘thas a tangent 
line at every point’’). 

It turns out that the congruence zeta-function of any elliptic curve E 
defined over F, has the form 

— 2a,T + qT? 


Z(E/F,; T) = l 


1.3 
(l= T)0 — 47)’ Co 


where only the integer 2a, depends on E. We shall soon prove this in the 
case of the elliptic curve E,,: y? = x? — n?x. Let « be a reciprocal root of 
the numerator; then 1 — 24,7 + qT? = (1 — aT)(1 — 27). If one takes the 
logarithmic derivative of both sides of (1.3) and uses the definition (1.1), 
one easily finds (see problems below) that the equality (1.3) is equivalent 
to the following formula for N, = #E(F,r): 


N,=q' + 1— a" — (q/a)’. (1.4) 
As a special case of (1.4) we have 
N, = #E,)=q+1—a—4=9 +1 —2ag. (1.5) 


Thus, if we know that Z(E/F,; T) must have the form (1.3), then we can 
determine a; merely by counting the number of F,-points. This will give us 
Z(E/F,; T), the value of «, and all of the values N, = #E(Far) by (1.4). 
In other words, in the case of an elliptic curve, the number of F,-points 
determines the number of F,,-points for all r. This is an important property 
of elliptic curves defined over finite fields. We shall prove it in the special 
case y? = x? — n*x. 

It will also turn out that « is a quadratic imaginary algebraic integer whose 
complex absolute value is ,/q. In the case y? = x* — n?x, it will turn out 
that « is a square root of —g if ¢ = 3 (mod 4), and is of the form a + bi, 
a, beZ,a? + b* =q, if gq = 1 (mod 4). 

This situation is a special case of a much more general fact concerning 
smooth projective algebraic varieties over finite fields. The general result 
was conjectured by André Weil in [Weil 1949], and the last and most 
difficult part was proved by Pierre Deligne in 1973. (For a survey of Deligne’s 
proof, see [Katz 1976a].) We shall not discuss it, except to state what it says 
in the case of a smooth projective curve (one-dimensional variety): 


(i) Z(V/F,; T) is a rational function of T (this is true for any variety 
without the smoothness assumption) which for a smooth curve has the 
form P(T)/(1 — T)(1 — qT). Here P(T) has coefficients in Z and con- 
stant term | (equivalently, its reciprocal roots are algebraic integers). 

(ii) If V was obtained by reducing modulo p a variety V defined over Q, 
then deg P = 2g is twice the genus (“Betti number’’) of the complex 
analytic manifold V. Intuitively, g is the “number of handles” in the 
corresponding Riemann surface. An elliptic curve has g = 1, and the 
Riemann surface in Fig. II.1 has g = 3. 
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Figure II.1 


(iii) If w is a reciprocal root of the numerator, then so is g/«. 
(iv) All reciprocal roots of the numerator have complex absolute value Vq- 


One reason for the elegance of the Weil conjectures is the intriguing 
indirect connection between the ‘‘physical’’ properties of a curve (e.g., its 
number of handles as a Riemann surface when considered over C) and the 
number theoretic properties (its number of points when considered over 
F,r). Roughly speaking, it says that the more complicated the curve is (the 
higher its genus), the more N,’s you need to know before the remaining 
ones can be determined. In the simplest interesting case, that of elliptic 
curves, where g = 1, all of the N,’s are determined once you know N,. 


PROBLEMS 


I. 


. Show that if NV, -| 


Show that if N, = N* + N** and Z(T), Z*(T), Z**(7) are the corresponding 
zeta-functions, then Z(T) = Z*(T):Z**(7); and if N, = N,* — N-**, then Z(7) = 
Z*(T)/Z**(T). 


. Show that if there exists a fixed set x,, ...,4,, 8,, ..., 6, such that for all r we have 
N= Bi +++ + Bf — af — --- — af, then 
27) = ba DU = 27) = 467) 
(1 —B,T)C — B,T)---d — £7) 
. Prove that if |N,| < CA’ for some constants C and A, then the power series Z(T) 


converges in the open disc of radius 1/A in the complex plane. 


1, reve 


n; : f 
5 then Z(7) is not a rational function; but if NV, = 


>. ; 


2, reven; : : : 
: dd then Z(T) is rational. In the latter case, interpret N, as the number 
>» Fr odd, 


of F,.-solutions of some equation. 


. The Bernoulli polynomials B,(x)¢Q[x] have the properties: (i) deg B, = r; (ii) for 


all M, B,(M) — BO) = r(1! + 2771 4 --- +(M — 1)'). Now for fixed M let 
N,_, = +(B,(M) — B,(0)). Find the corresponding Z(7). (Cultural note: B,(x) = 
x — 1, B,(x) = x? —x +4, etc.: they are uniquely determined by properties (i) 
and (ii) along with the normalization requirement that {j B,(x)dx = 0 for r > 1. 
One way to define them is by equating terms in the relation: re%/(e' — 1) = 
Lo B(x) /r!.) 
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6. 


10. 


Il. 


14. 


Suppose that / is a prime, qg is a power of another prime p, g = | (mod /), gq #1 

(mod /*). 

(a) For fixed M, let N, = # {xe Far|xt™ = 1}. Find the corresponding Z(T7). 

(b) Now let N, = # {x € Fyr|x = 1 for some M}. Find the corresponding zeta- 
function. Is it rational? 


. A special case of an affine or projective variety V is the entire space, corresponding 


to the empty set of equations. Let AZ denote m-dimensional affine space (the 

usual space of m-tuples of numbers in the field K), and let P% denote projective 

space, as usual. 

(a) What is ZAG Fa; T)? 

(b) Find Z(PE,/F a; T) by writing P% as a disjoint union of Ak, k =m, m—1, 
..., 0, and using Problem 1. 

(c) Also find Z(PE, /F,; T) by counting equivalence classes of (m + 1)-tuples, 
and check that your answers agree. 


. Show that, if Vis a variety in Af, or P7,, then Z(V/E, ; T) converges for |T|<q™. 


. If one wants to prove that Z(V/EF,; T)¢Z[[T]] with constant term | for any affine 


or projective variety V, show that it suffices to prove this when V is any affine 
variety. Then show that it suffices to prove this when V is given by a single equation. 
Show that the rationality assertion Z(V/F,; T)¢Q(T) can also be reduced to the 
case of an affine variety V defined by a single equation. A variety defined by a 
single equation is called a “hypersurface’’. 


Find the zeta-function of the curve y? = x? — n?x in PE, if p|2n, ie., p is not a 
prime of good reduction. 


Find the zeta-function of the hypersurface in AG, defined by x,x, — x3x4 =0. 


. Let N, be the number of lines in P? ,. Find its zeta-function. (It is possible to view 
q 


the set of k-dimensional subspaces in PX as a variety, called the grassmannian; 
in our case k = 1, m = 3.) 


. Using the form (1.3) for the zeta-function of an elliptic curve, where the numerator 


has reciprocal root a, show that N, is equal to the norm of 1 — «”. Now, in the 
situation of Problem 13 of §1.9, suppose that E has /? F,-points of order /, and no 
F,-points of exact order /?. Prove that the field extension of F, generated by the 
coordinates of the points of order /“*? is F,!“. (Note the close analogy with the 
multiplicative group F*, with g = 1 (mod /) but q # 1 (mod /”), where the field 
generated by all /“*!-th roots of unity is Fy.) 


Let V be an affine algebraic variety defined over K by equations f,(x,, ..., X,) = 

0. By the coordinate ring R(V) we mean the quotient ring of K[x,,..-, Xml 

by the ideal generated by all of the fj. Let P = (a,, ..., 4,,) be a K"8*!_point on 

V. Let L = K(a,, ..., @,,) be the finite extension of K generated by the coordinates 

of P. L is called the residue field of P, and its degree over K is called the residue 

degree. 

(a) Show that the map x;+~a; is well-defined on R(V), and extends to a homo- 
morphism whose kernel is a maximal ideal m(P) in R(V). (It is not hard to 
prove that every maximal ideal of R(V) arises in this way.) 

(b) Show that m(P’) = m(P) if and only if there is an isomorphism from L to L’ 
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(the residue fields of P and P’, respectively) which takes a; to a;. Thus, the 
maximal ideal m(P) corresponds to d different K***!-points P on V, where 
d=[R(V)/m(P): K ] is the residue degree of any of the points P. 


15. In the situation of Problem 14, let K = F,. For a given K*®‘'-point P, the residue 
field is F,4 for some d. Then P contributes | to each N, for which r is a multiple of 
d. That is, the contribution of P to the exponent in the definition of the zeta- 
function is LP, T/kd. Then Z(V/F,; T) is exp of the sum of all contributions 
from the different K**°'-points P. Group together all points corresponding to a 
given maximal ideal, and express Z(V/F,; T) as the product over all maximal 
ideals m of (1 — T*®™)~!. Then show that the zeta-function belongs to 1 + 
TZ{[[T]]. (Cultural note: If we make the change of variables T = q~*, and define 
Norm(m) to be the number of elements in the residue field, i.e., Norm(m) = q*°8", 
then we have Z(V/F,;q °)=,(1 — Norm(m)~‘)~', which is closely analogous 
to the Euler product for the Dedekind zeta-function of a number field: (,(s) = 
T1,(1 — Norm(p)~‘)~', in which the product is over all nonzero prime ideals of 
the ring of integers in the field K. In a number ring, a nonzero prime ideal is the 
same as a maximal ideal.) 


16. Prove that if Z(V/F,; T)¢€Q(7), then the numerator and denominator are in 
1+ TZ[T ] (equivalently, the as and f’s in Problem 2 are algebraic integers). 


§2. The zeta-function of E,, 


We now return to our elliptic curve E,, which is the curve y? = x3 — nx, 
where n is a squarefree positive integer. More precisely, E,, is the projective 
completion of this curve, i.e., we also include the point at infinity. E,, is 
an elliptic curve over any field K whose characteristic does not divide 2n, 
and, as we have seen, it is sometimes useful to take K = F,, or more generally 
K=F,. The purpose of this section is to express the number of F,-points 
on E, in terms of “‘Jacobi sums”’. 

To do this, we first transform the equation of E, to a “diagonal form”. 
We say that a hypersurface f(x,, ..., X,) =0 in Aj is “diagonal” if each 
monomial in f involves at most one of the variables, and each variable occurs 
in at most one monomial. For example, the ‘‘Fermat curve” x“ + y4 = 1 is 
diagonal. It turns out that diagonal hypersurfaces lend themselves to easy 
computation of the N, (much in the same way that multiple integrals are 
much easier to evaluate when the variables separate). We shall not treat the 
general case, but only the one we need to evaluate N, = #£,(F,r). (For a 
general treatment of diagonal hypersurfaces, see [Weil 1949] or [Ireland 
and Rosen 1990, Chapter 11].) 

We first show a relation between points on E,: y? = x3 — n?x and points 
on the curve E”: u? = v+ + 4n?. As usual, we suppose that p / 2n. First suppose 
that (u, v) is on E’. Then it is easy to check that the point (x, y) = (G(u + v7), 
ty(u + v?)) is on E,. Conversely, if (x, y) is on £, and its x-coordinate is 
nonzero, then we check that the point (uw, v) = (2x — y?/x?, y/x) is on E;. 
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Moreover, these two maps are inverse to one another. In other words, we 
have a one-to-one correspondence between points on F, and points on 
E,, — {(0, 0)}. Let N’ be the number of F,-solutions (u, v) to u? = v* + 4n?. 
Then the points on our elliptic curve consist of (0, 0), the point at infinity, 
and the N’ points corresponding to the pairs (u, v). In other words, N, = 
#E,(F,) is equal to N’ + 2. So it remains to compute V’. The advantage 
of the equation u? = v* + 4n? is that it is diagonal. 

The basic ingredients in determining the number of points on a diagonal 
hypersurface are the Gauss and Jacobi sums over finite fields. We shall now 
define them and give their elementary properties. 

Let w: F, > C* be a nontrivial additive character, i.e., a nontrivial homo- 
morphism from the additive group of the finite field to the multiplicative 
group of complex numbers. (Since F, is finite, the image must consist of 
roots of unity.) In what follows, we shall always define w(x) = €1*, where 
€=e?"?, and Tr is the trace from F, to F,. Since the trace is a nontrivial 
additive map, and its image is F, = Z/pZ, we obtain in this way a nontrivial 
additive character. 

Now let x: Ff > C* be any multiplicative character, i.e., a group homo- 
morphism from the multiplicative group of the finite field to the multi- 
plicative group of nonzero complex numbers. In what follows, the additive 
character W will be fixed, as defined above, but y can vary. 

We define the Gauss sum (depending on the variable x) by the formula 


gM = Y x0OW) 
xe Fg 
(where we agree to take 7(0) = 0 for all y, even the trivial multiplicative 
character). We define the Jacobi sum (depending on two variable multi- 
plicative characters) by the formula 


Ia, 2) = > X1(*)x201 — x). 
xe q 

The proofs of the following elementary properties of Gauss and Jacobi 
sums are straightforward, and will be left as exercises. (Here y,,;, denotes 
the trivial character, which takes all nonzero elements of F, to 1; x, 7,, and 
%2 denote nontrivial characters; and ¥ denotes the complex conjugate (also 
called “‘inverse’’) character of x, whose value at x is the complex conjugate 
of 7(x).) 


CQ) 9Gariw) = —13 IOtive Keri) =F -— 23 Ierivs Y= 1; 
IG 0 = —K- Ds JMG, %2) =I 1); 

(22) go: 9M =x(—-Na; lg@O| = Va: 

(3) JQ, t2) =9KdIGVIN 2) if %FU- 


We now proceed to the computation of the number N’ of u, ve F, satisfying 
u? = v* + 4n?. The key observation in computing N’ is that for any a 4 0 
in F, and any m dividing g — 1, the number of solutions x¢é F, to the equation 
x” = a is given by: 
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Hix" Sal= ¥: x(a), (2.1) 
xm 

where the sum is over all multiplicative characters whose m-th power is the 
trivial character. Namely, both sides of (2.1) equal m if a is an m-th power 
in F, and equal 0 otherwise; the detailed proof will be left as a problem 
below. 

By Proposition 16 of the last chapter, we know that N, = q+ lifq=3 
(mod 4). In what follows, we shall suppose that g = 1 (mod 4). 

In counting the pairs (u, v), we count separately the pairs where either wu 
or v is zero. Thus, we write 


N’ = #{ueF,|u? = 4n?} + # {ve F,|0 = v* + 4n?} 


(2.2 
+ # {u, veF*|u? = v* + 4n7}. 


The first term in (2.2) is obviously 2 (recall that we are assuming that p}2n). 
We use (2.1) to evaluate the second term. Let y, be one of the characters 
of F* having exact order 4, 1.e., y4(g) = i for some generator g of the cyclic 
group F¥. Then, by (2.1), the second term in (2.2) equals 


y 44(—4n?) = 2 + 244(—4n’) (2.3) 
j=l 


(where we use the fact that —4n? is a square in Fs). Finally, we evaluate the 
third term in (2.2). Let y, denote the nontrivial character of order 2 (1.e., 
% = x4). Using (2.1) again, we can write the third term in (2.2) as 

Y #waa eet =F}= YY P@x@—an’). 


a,beF acFg,a—4n7#0 j=1,2,3,4 
a=b+4n2 k=1,2 


Note that since 7j(0) = 0, we can drop the condition a — 4n? 4 0 on the 
right. We now make the change of variable x = a/4n? in the first summation 
on the right. As a result, after we reverse the order of summation, the right 
side becomes 

DY xh(—4n?) Y Boomd-—xN= Ye x4(—4n?)J(x2, x2)- 


J=1,2,3,4 xeFg J=1,2,3,4 
1,2 k=1,2 


Finally, bringing together the three terms in (2.2) and using property (1) of 
Jacobi sums when y‘ or yj is trivial or they are conjugate to one another, 
we obtain: 


N’=4 +4 244(—4n?) + Yo xh(—4n?) (a2, 14) $9 — 243-(-D 
j=1,3 


+ 2y4(—4n?) -(—1) (2.4) 
= q—14 7%4(—4n’) (J (Xa, Xa) + S(X2, Ha))- 


In the problems we show that x,(—4) = 1. Hence, 74(—4n7) = x(n). Thus, 
if we set 


a= nig def —nMJ(D; Xa), (2.5) 
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we conclude that 
N, = #£,(F,) =qtl—a-—@. (2.6) 


Notice that « is an algebraic integer in Q(Z), since the values of x, and 74 
in the definition of J(z,, 74) are all +1, +7. We now pin down the Gaussian 
integer a = a + bi, at least in the case when g = p= is a prime congruent to 1 
mod 4 or q = p’” is the square of a prime congruent to 3 mod 4. By property 
(3) relating Jacobi to Gauss sums, we have 


a= —72(1)g(X2)9(Xa)/9(Ka), 


and hence, by property (2), we have |a|? = a? + b? =q. In the two cases 
gq =p = | (mod 4) and q = p’, p = 3 (mod 4), there are very few possibilities 
for such an «. Namely, in the former case there are eight choices of the form 
tat bi, +b + ai; and in the latter case there are the four possibilities +p, 
+pi. The following lemma enables us to determine which it is. 


Lemma 1. Let q = | (mod 4), and let x, and x, be characters of F%, of exact 
order 2 and 4, respectively. Then 1 + J(x2, 4) is divisible by 2 + 2i in the 


ring Z[i]. 


Proor. We first relate J(z,, 74) to J(%4; %4) by expressing both in terms of 
Gauss sums. By property (3), we have: (x2, %4) = J(%a» %4)9(X2)"/9(Xa)9 (Ha) 
= x4(—1)J (4; Xa) by property (2). Next, we write 


Ita, ta) = Vita ad — x) = EF) + 2 axa — »), 


where &’ is a sum over (q — 3)/2 elements, one from each pair x, 1 — x, 
with the pair 85", “5 omitted. Notice that y,(x) is a power of i, and so is 
congruent to | modulo | + iin Z[i]; thus, 2y,(x)y¥,(1 — x) = 2(mod 2 + 2i). 
Asa result, working modulo 2 + 2, we have J(74, 74) =9 —3 + 73(5% = 
2 + 7~4(4) (since g = | (mod 4)). Returning to J(z,, 74), we obtain: 


1+ S(x2, Xa) = 1 + Xa(— DI (Xa, Xa) = 1 + Xa(—4) + 244(-D) 
(mod 2 + 2i). 


Since ¥,(—4) = 1, as mentioned above (and proved in the problems below), 
and since 2(1 + y4(—1)) = 0 or 4, it follows that 1 + J(z), x4) is divisible by 
2 + 2i, as claimed. Oo 


We now have the basic ingredients to prove a formula for Z(E,/F,; T). 


Theorem. Let E,, be the elliptic curve y? = x°* — n*x defined over F,, where 
pi2n. Then 

1—2aT+pT? _(1—aT)(1—27) 
ha pr) <= 2) pt)" 
where a = Rea; a= i,/p if p = 3 (mod 4); and if p = | (mod 49), then « is an 
element of Z[i] of norm p which is congruent to ($) modulo 2 + 2i. 


Z(E,/F,; 1) = (2.7) 
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Before proving the theorem, we note that in the case p = 1 (mod 4) it says 
we choose « = a + bi with a odd (and 5 even), where the sign of a is deter- 
mined by the congruence condition modulo 2 + 23. There are two possible 
choices a + bi and a — bi; and of course the formula (2.7) does not change 
if we replace a by its conjugate. 


Proor. In order to obtain Z(E,/F,; T), we must let the power of p vary, and 
determine N, = #E,(F,r) for p = 1 (mod 4) and N,, = #£E,(F,r) for p = 3 
(mod 4), g = p? (since we know that N, = p’ + 1 for odd r in that case). So we 
fix g equal to p in the first case and equal to p? in the second case (in either 
case g = 1 (mod 4)), and we replace qg by q" throughout the work we did 
earlier to find a formula for #E,(F,), ¢ = 1 (mod 4). 

Because the r is varying, we need a notation to indicate which x, and y4 
we are talking about, i.e., to indicate for which finite field they are multi- 
plicative characters. Let x, ; = ¥, denote the unique nontrivial character of 
F* of order 2, and let x, ; = x4 denote a fixed character of FF of exact order 4 
(there are two, the other one being 7,). Then by composing y, or x, with 
the norm from F,, to F,, we obtain a character of Fj of exact order 2 or 4, 
respectively. We denote these characters y,, and y4,,. For example, if g is 
a generator of F* such that y4(g) = i, and if g, is a generator of Fj whose 
norm is g, i.e., (g,)!*4*'"*” ' =g, then we have 74,,(g,) = i. If N, denotes 
the norm from F,, to F,, we can write our definitions: 


X4.r = X4 ° N,. X27 = X2 ° N,. (2.8) 
With these definitions, using (2.5) and (2.6), we can write: 


#E,(F, r) => qd + 1 — Xn, ra ay, rs 
: ; : (2.9) 
9(h2.)9 ar) 


G(Xa,r) 


where 4), gr = —%2,.(") 


We now use a basic relationship, called the Hasse—Davenport relation, for 
Gauss sums over extensions of finite fields. The Hasse-Davenport formula 
is: 


—gHON,) = (-9@Y- (2.10) 


The proof of this fact will be given in a series of exercises below. Applying 
(2.10) to the three Gauss sums in (2.9), and observing that x2 _,(7”) = x2(n") = 
%2(n)", we conclude the following basic relationship: 


On gt = Og: (2.11) 


The theorem now follows quickly. First suppose p = 1 (mod 4), in which 
case q = p. Then x,(n) is the Legendre symbol (5). Using (2.5) and Lemma 1, 
we find that « = «,, is a Gaussian integer of norm p which is congruent to 
(3) modulo 2 + 27; and, by (2.9) and (2.11), 
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N,=p'+1—a—@, 


This proves the theorem when p = | (mod 4) (see Problem 2 of §II.1). 

Now suppose that p = 3(mod 4), g = p*. Then x,() = 1, since all elements 
of F,, are squares in F,,2. Then Lemma | tells us that «,, , is a Gaussian integer 
of norm q which is congruent to | mod 2 + 2i. Of the four Gaussian integers 
ip, j =, 1, 2, 3, having norm gq, only «,,, = —p satisfies the congruence 
condition. Then, by (2.9) and (2.11), we conclude that for r even we have 


N, = #E,(Far2) = p’ + 1 — (—py? — (— py”. 
Since N, = p’ + 1 for odd r, we have for any r: 
N, =p" +1 —(in/py — (~ivpy. 
This completes the proof of the theorem. oO 


We conclude this section by calling attention to the role Lemma | has 
played in pinning down the reciprocal roots a and & in (2.7). The congruence 
condition in Lemma | will again be needed when we start working with the 
Hasse—Weil L-function of the elliptic curve E,,, which combines the a’s for 
different primes p. In that context, Lemma | is a special case of a general 
fact about how Jacobi sums vary as we vary the prime p. The general case 
is treated in [Weil 1952]. 


PROBLEMS 
1. Prove properties (1)--(3) of Gauss and Jacobi sums that were given in the text. 


2. Let G be a finite group, and let G denote the group of characters ; (i.e., of homo- 
morphisms 7: G > C*). Recall that for any nontrivial 7eG, <6 X(g) = 0. Notice 
that any fixed geéG gives a character g: y++ z(g) on the group G, and also on any 
subgroup SG. Apply these general considerations to the case when G = re 
and S is the subgroup of characters 7 such that 7” = 1. In that way prove the 
relation (2.1) in the text. 


3. Let g = 1 (mod 4), and let 7, have exact order 4. Show that 7,(4) and 7,(—1) are 
both equal to | if g = 1 (mod 8) and equal to —1 if ¢g = 5 (mod 8). Conclude that 
744(—4) = | in all cases. 


4. Show that g(y)? = (—1)"!"?q. It is somewhat harder to determine which square 
root to take to get g(7) (see [Borevich and Shafarevich 1966, pp. 349-353]). 
Compute g(z,) when qg = 3, 5, 7, 9. 


5. For g = 1 (mod 4), again let x, be the nontrivial quadratic character, and let 7,4 
and 7, be the two characters of exact order 4. Compute J(z, 74) and J(72, %4) 
directly from the definition when q = 5, 9, 13, 17. 


6. Show that if x2 is the nontrivial quadratic character of F* and x is any nontrivial 
character, then J(7,, 7) = x(4)J(z vO. 


7. Let y3 and 7, be the two characters of Fj of order 3, where g = | (mod 3). Compute 
J(%3, %3) and J(Z3, Z%3) directly from the definition when g = 7, 13. 
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8. (a) Notice that we proved that the number N, of F,r-points on E,, is independent 
of n if r is even. Show this directly. 
(b) Also notice that N, does not change if n is multiplied by an integer which is 
a square in F,. This is for the same reason that we could, without loss of gen- 
erality, reduce to squarefree » when considering @-points. Namely, if K is 
any field not of characteristic 2 and if m, ne K*, construct a simple corre- 
spondence between £,(K) and E,,,,2(K). 


9. This problem concerns a more general definition of Gauss sums, examples of 
which will occur later in the chapter. Let R be the ring of integers in a number field 
K, and let J be a nonzero ideal of R. Then R// is a finite ring. Let wy: R/J—> C* be 
an additive character which is nontrivial on any additive subgroup of R// of the 
form J// for any strictly larger ideal J > J (including the “improper ideal” J = R, 
which will be the only such J/ if / is a prime ideal). Define the norm NJ = #(R//). 
Let x: (R/1)* > C* be any multiplicative character. Take y(x) = 0 for xe R/J not 
prime to /. Define g(y) =9(% ) Er Lv), where the summation is over 
xER/I. 

(a) Prove that X y(x)W(ax) = x(a)g(z, W) for any ae (R/1)*. 

In parts (b) and (c) we suppose that x is “‘primitive’’ modulo /. By definition, this 
means that, for any strictly larger ideal J > J, x is nontrivial on the subgroup of 
(R/J)* consisting of elements congruent to | modulo J. 

(b) If x is primitive, show that the formula in part (a) holds for all ae R/J. 

(c) For x primitive, prove that g(x. Wg(% W) = x(— NJ, and |g(z, )| = /NV. 
Some examples of the characters and Gauss sums in this problem are: (1) if Jis a 
prime ideal with residue field F,, then property (2) of Gauss sums in the text is a 
special case of part (c); (2) if R= Z and / is the ideal (NV), then x is an ordinary 
Dirichlet character. NJ = N, we often take w(x) = e?"*/", and “primitive” means 
that the value of x(x) for xe(Z/N Z)* does not depend only on its residue modulo 
some proper divisor of N ; (3) later in the chapter we will encounter examples where 
R=Z{[i]. 


Problems 10—17 will lead to a proof of the Hasse—Davenport relation. 


10. Let S be the set of all monic polynomials in F,[x], and let S'” denote the subset of 
all irreducible monic polynomials. Subscripts will indicate degree. By writing 
xY x= THz cr,,(X — #), prove that x” — x = TIf, where the product is over all 
fin Si for all d dividing r. 


11. Let y be a nontrivial additive character and y a multiplicative character of F,. 
If fe S is written in the form f(x) = x? —c,x*1 +--+ + (—1)*c,, define a map 
A: SC by A(f) = x(cgW(c,). Uf f= 1 is the constant function in Sp, then define 
2(1) = 1.) Prove that A({,, fa) = 4041) 4(f2) fori, eS. 


12. Prove that the Gauss sum can be written g(z%) = Zyes, A(/)- 


13. Suppose that «el, satisfies monic irreducible polynomial fe Sj", where d|r. 
Then show that 1(/)’" = y,(a) - w,(a), where the subscripts here indicate the charac- 
ters of F,r obtained by composing with the norm from F,, to F, (in the case of a 
multiplicative character) or with the trace from F,, to F, (in the case of an additive 
character). 


14. Prove that 9(7,) = Lay Lye sir aD SY". 


§2. 


15. 
16. 
17. 


18. 


19. 


20. 


21. 
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Prove the power series identity Lp.sA(f)T%*! = Mypesier (1 ~ AC f)T**/). 
Show that if d > 1, then LresyAS) =0. 
Taking the logarithmic derivative of both sides in Problem 15, prove that 


(“ly 'gar=d Yan’ 


dlr fe sitr 
and conclude the proof of the Hasse—Davenport relation. 


(a) Show that the ideal (2) in Z[i] is the square of the prime ideal (1 + i); and 
that any element «¢Z[/] not in (1 +) has a unique associate i/w which is 
congruent to 1 modulo (1 + {)? = (2 + 2i). 

(b) Show that the ideal (3) in Z[w], w = (—1 + /—3)/2, is the square of the 
prime ideal (,/— 3); and that any element «¢Z[«] not in (,/—3) has a unique 
associate (—w)4a which is congruent to 1 modulo 3. 


Consider the elliptic curve y? = x* — a, aeF¥. Recall from Problem 4 of §1.9 
that it has g + 1 points if g = 2 (mod 3). So suppose that g = 1 (mod 3). Let x, 
be the nontrivial quadratic character of Fy, and let ~, be either of the nontrivial 
characters of F¥ of order 3. Prove that the number of F,-points on the elliptic curve 
is equal to 


g+1+ %2(—a)(X3@J (Xa, X3) + X3(@I (x2, %3))- 


Let g = | (mod 3), and let x, be a nontrivial character of Fy of order 3. 

(a) Prove that g/(73, %3) = 9(x3)°- 

(b) Prove that J(x3, 3) = —1 (mod 3) in Z[w], where w = (—1 + i,/3)/2. 

(c) Show that J(y3, 73) =p if q =p’, p = 2 (mod 3). 

(d) Suppose that g = p= 1 (mod 3). Choose a+ bw so that p =|a+ bal? = 
a? — ab + b?. Show that exactly one of the two ideals (a+ bw), (a + b@) 
(without loss of generality, suppose the first one) has the property that 


¥3(x) =x" (mod a+ bw) forall xeF,. 


(e) Let g = p = 1 (mod 3), and choose a + bw as in part (d). Show that —J(73, 73) 
is the unique element generating the ideal (a + bw) which is congruent to 1 
modulo 3. 


Let N, be the number of F,,-points on the elliptic curve y* = x? — a, where aeF*, 

p #2, 3. 

(a) If p = 1 (mod 3), let x, and x, be nontrivial characters of F¥ of order 2 and 3, 
respectively, and set « = —y,(—a)y3(4a)J(73, 73). Prove that NV, = p’+1— 
a” — a. 

(b) If p = 2 (mod 3), let ~, and x; be nontrivial characters of Fy: of order 2 and 3, 
respectively. First prove that y, and x, are both trivial on elements of FF. 
Now set « = i,/p. Prove that N. = p"+ 1 — a" — @’. 

(c) Conclude that in both cases the zeta-function is (1 — 2c7 + pT’)/( — T)(1 — 
pT), where c= 0 if p = 2 (mod 3), and c = —7,(—a)Re(73(4a)J(43, x3)) if 
Pp = 1 (mod 3). 


Let C < P2 be the curve y? + ay = x, aeK (ie., F(x, y, z) = y?z + ayz? — x). 


(a) Find conditions on the characteristic of K and on ae K which are equivalent 
to C being smooth at all of its K7*“'-points. 
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(b) Let K = Fr. Show that for r odd, #C(F,r) = 2” + 1 (this is independent of a). 
(c) Let K = Fyr, ae K, a #0. Let x, be a nontrivial character of K* of order 3, 
and let 7; be the other one. Derive the formula: 


#C(Far) = 4 +14 73(@I (Xs, X3) + 12(@I (Xs, Xa)- 
(d) In the situation of part (c), show that J(z3, 73) = (— 1)""!2”; then find a formula 
for Z(C/F,; T) whena = 1. 
(e) Now let K = @, a= 1. Find a linear change of variables (with coefficients 
in Q) which transforms C to the elliptic curve y? = x? + 16. 


23. Let N, be the number of F,,-points on the elliptic curve E,: y? = x* — n’x, where 
pf 2n. 

(a) Show that if p = 3 (mod 4), then N, is independent of n; it equals p’ + 1 if 
r is odd; and it equals (p"? — (—1)")? if r is even. 

(b) Now let p = 1 (mod 4). In Problem 8 above, we saw that N, is independent 
of n if r is even, and if r is odd it depends only on whether v is a quadratic 
residue or nonresidue modulo p. For odd r, let N{** and N,* denote the N, 
for na residue and for » a nonresidue, respectively. Show that N,, is a multiple 
of the least common multiple of N° and N,". 

(c) For p=5, make a table of N/* and N," for r= 1, 3, 5, 7 and a table of N, 
for r= 2, 4, 6, 8, 10, 12, 14. In each case, determine the type of the abelian 
group E,(F,r). (See Problems 9 and 11 in 81.9.) 

(d) For p = 13, make a table of N;/** and N," for r = 1, 3, 5 and a table of N, for 
r = 2, 4, 6, 8, 10; and in each case, find the type of E,,(F,,). 


§3. Varying the prime p 


In this section we look at the elliptic curve E,: y? = x* — n*x and its zeta- 
function Z(E,/F,; 7) as p varies. We shall later want to combine these 
zeta-functions for the various p into a single function, called the Hasse—Weil 
L-series of the elliptic curve. It is the Hasse—Weil L-function that is intimately 
related to the group of Q-points on £,,. 

The denominator of Z(E,/F,; 7) is always (1 — T)(1 — pT). Only the 
numerator depends on p. If p|2n, in which case E, is not even an elliptic 
curve, the numerator is simply | (see Problem 10 in §II.1). Otherwise, the 
numerator is a quadratic polynomial in T of the form (1 — a7)(1 — @7). 

When we later define the Hasse—Weil L-series of E,,, we shall take this 
quadratic polynomial and replace T by p* (s is a new complex variable). 
The resulting expression (1 — ap~°)(1 — ap *) is called the “Euler factor at 
p”’, by analogy with the term in the Euler product expansion of the Riemann 
zeta-function: 


(= y nis |] (where Re s > 1). 3.1) 


primes p 


l-—p* 


In this section we shall study how this “Euler factor” depends on p. This 
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dependence will turn out to be described by a certain character x, of Z[i] 
(see Problem 9 of the last section). 

For the duration of this section we shall let P denote prime ideals of the 
Gaussian integer ring Z[i]. There are two types: (1) P=(p) for p=3 
(mod 4); (2) P= (a+ bi) for a? + b*? = p = 1 (mod 4). In the latter case 
we have PP = (p), and we say that p ‘splits’ in Z[i]. (There is also the 
special case P = (1 + i), which ‘“ramifies’’, i.e., P? = (2).) The degree of a 
prime P dividing (p) is defined to be the degree of the field extension Z[i]/P 
of F,,; it is 2 in the first case and | ifp splits. We can then rephrase the theorem 
in the last section as follows. 


Proposition 1. 
(1 — T)(1 — pT)Z(E,/E,; T) = T] (1 — (@pT)**”), (3.2) 


P\(p) 


where the product is over the (one or two) prime ideals of Z[i] dividing (p), 
and where ap = ix/p if P= (p) and ap = a+ biif p splits, where a + bi is the 
unique generator of P which is congruent to (8) modulo 2 + 2i. We take ap = 0 


if P\(2n). 


We now define a map Z, on Z[i] which will be multiplicative and will 
satisfy 7,(x) = a$°8” for any generator x of P = (x). This multiplicative map 
is of the form ¥,(x) = xy;,(x), where y,,(x) has value 0, +1, or +7. First of all, 
we define x;(x) = 0 if x has a common factor with 2n. Next, for n= 1 we 
define 7{(x) to equal i/, where // is the unique power of i such that x = 1 
(mod 2 + 27). Here x is assumed prime to 2, and hence an element of 
(Z[i]/(2 + 27))*, which has four elements represented by the powers of i. 
Finally, for other n and for x €Z[i] prime to 2, we define ¥,(x) = 71 (*) (RR), 
where Nx = xX is a positive odd integer, and (m) 1s the Legendre symbol 
(which extends from prime modulus (;) to arbitrary positive odd modulus 
by requiring that (sm;) = Gm,) Ga;)). To summarize, we have defined: 


14 (x) (4) for x prime to 2n; 
HnlX) = XX(X)S — Xn(X) = ; * (3.3) 


otherwise; 
where for x prime to 2 
xyi(x) =i with i/x=1 (mod 2 4 27). (3.4) 


Suppose that x generates a prime ideal P = (x) not dividing 2n. If P = (p) 
with p = 3 (mod 4), then (qj) = Gz) = I, and Z,(x) = ix = —p. That is, Z, 
takes any of the four possible generators of P to «3. If x is any of the four 
possible generators of a prime P of norm p=1 (mod 4), then ¥,(x) = 
/x(%) = (8) (mod 2 + 2i), ie., X%,(x) is the unique generator ap which is 
congruent to (;) modulo 2 + 27. We have thus shown: 
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Proposition 2. The map ¥,, defined in (3.3)—(3.4) is the unique multiplicative 


map on Z[i | which coincides with «8? on any generator of a prime ideal P. 


Notice that y; is a character on (Z[i]/(2 + 27))*. It takes any x to the 
root of unity in the class 1/x. The general y;, is obtained from yj using the 
Legendre symbol, with the variable x appearing on the bottom. We now 
use quadratic reciprocity to bring the variable x up on top, thereby showing 
that y;, is a character. At this point recall Problem 9 of the last section, in 
particular, the definition of a “primitive” character on a number ring. 


Proposition 3. The map y,, defined in (3.3)—(3.4) is a primitive multiplicative 
character modulo (2 + 2i)n for odd n and modulo 2n for even n. 


ProorF. Suppose x is prime to 2n. Let n = 2°/, --- /,, where the /; are distinct 
odd prime numbers, and ¢ = 0 or 1. Note that Nx is a product of odd prime 


powers, where the primes p,, ..., p, occurring to odd powers are all con- 
gruent to | (mod 4). First, it is easy to see that 
1 if Nx = 1 (mod 8); 
2). ; . ( ) (3.5) 
Nx —1 if Nx = 5 (mod 8). 


Next, we compute that 


re) los Gl 


by quadratic reciprocity, since p, = 1 (mod 4). Since Nx is equal to an odd 
square factor times the product of the p,, we conclude that 


wo = 000 (2) TI (“)-no(Z)(E) 69 


where ny = nif n is odd, ny = n/2 if n is even. 

We now prove the proposition in the case n odd. The proof for n even is 
very similar, and will be left as an exercise below. 

We must first show that 7;,(x) depends only on what x is modulo (2 + 2/)n. 
Suppose that x’ = x + (2+ 2i)nB. Since x’ =x (mod 2 + 27), we clearly 
have x} (x’) = x; (x). Next, we have 


Nx = (x + (2 + 2i)nB)(X + (2 — 2i)nB) = x-X=Nx (mod n), 


and hence the Legendre symbols are also equal. (This would not have been 
clear until we used quadratic reciprocity to bring Nx to the top, obtaining 
(S*) in (3.6).) 

To show primitivity, we must show that there is no proper divisor of 
(2 + 2i)n such that x;(x) depends only on what x is modulo that proper 
divisor. Thus, if x, were not primitive mod (2 + 2i)n, there would exist a 
prime ideal Q dividing (2 + 27)n such that y;(x) depends only on x modulo 
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the ideal ((2 + 2i)n)/Q. In particular, y/(x) 4 —1 for all x = 1mod((2 + 
2i)n)/Q. We consider three cases, and show that each leads to a contradic- 
tion. 


(i) O=(1 +3), ie, x(x) # —1 for all x = 1+ 2nB, BeEZ[i]. But since 
Nx = 1 (mod n), we have 7/,(x) = x)(x) = 71 (1 + 26), and this value is 
—1 if, for example, B = i. 

(ii) O = (a+ bi) with (a + bi)(a — bi) = 1 = 1 (mod 4), /|n. Then we are 
supposing that y/(x) # —1 for all x of the form 1 + B(2 + 2i)n(a — bi)/l, 
where BeZ[i]. Let B = k(1 — i), where k is an arbitrary integer, i.e., 
x= 14 4kn(a — bi)//. Then x(x) = 1, and Nx = 1 + 8akn// (mod n). 
Hence, 7/(x) = (4*8#“). Since 8an// is prime to /, it follows that 
1 + 8akn// runs through all residues modulo / as k varies. In particular, 
there is a value of k for which 1 + 8akn// is a quadratic nonresidue, 
i.e., ¥,(x) = —1, a contradiction. 

(iii) O = (J with / = 3 (mod 4). Then we are supposing that y;,(x) # —1 for x 
= 1 (mod (2 + 2i)n//). Since x = 1 (mod 2 + 2i), we have x} (x) = 1, and 
so 74(x) = (82) = (&), since Nx = 1 (mod n//). Now since (2 + 2i)n/I 
is prime to /, it follows by the Chinese remainder theorem that x of the 
form | + B(2 + 2i)n// runs through all residues of Z[i] modulo Q. 
If we consider x modulo Q, i.e., as an element in the field Z[i]/Q ~ F,2, 
then the norm map x-> Nx = x-X is simply the norm map from F,2 
to F,. And the latter map is surjective (for instance, a generator g, of 
Fi goes to a generator g = g3"' of F*). Hence, there are x of the required 

form for which y/(x) = (4%) = —1. This concludes the proof of the 

proposition. Oo 


For the remainder of this chapter, we shall let n’ denote the conductor of 
x,, i.e., a generator of the largest ideal such that y;,(x) depends only on x 
modulo that ideal. By Proposition 3, we may choose 


; oo n odd; 
n= 


3.7 
2n, n even. Oe) 


Whenever one studies transformation formulas for functions involving 
characters, as we shall do in the sections that follow, the Gauss sum of the 
character is almost certain to make an appearance. In preparation for our 
later derivation of the functional equation for the Hasse— Weil L-series of E,,, 
we now find a formula for the Gauss sum of the character 7: (Z[i]/n’)*» C* 
(whose image consists of powers of i). 

We define our additive character on Z[i ]/n’ by the rule: 


w(x) = e2ni Re(xjn’) (3.8) 


It is easy to check that W is a nontrivial additive character of Z[i]/n’ which 
satisfies the condition in Problem 9 of the last section, namely, it is nontrivial 
on the multiples of any proper divisor of 7’. 
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Proposition 4. 
=) n’, nodd 
In) GD Halxyerm Ren) = (3.9) 


eee =) 
xeZ[i}/n in’, n= 2no even. 
No 


Proor. To show that g(z) = 2 + 2i and g(y) = 47 is a short computation 
that will be left as an exercise (Problem 2 below). 

Let m be a positive squarefree odd number. Let (@) denote the character 
xt (3) on (Z[i]/m)*. Then by (3.6) we have 


tn = 4 (;) for n odd; Xn = 4 (x) forn =2n, even. (3.10) 
0 
We define the Gauss sum for the character («) as follows: 


Nx ni Re(x/m 
(a) a FG 
xeZf[i]/m 


We can obtain an alternate form for g((m)) if we replace x by 2x. (Note that 
2x runs through (Z [i ]/m)* as x runs through (Z[i]/m)*.) Since N(2x) = 4Nx, 
we have (82%) = (82). Writing Re(2x/m) as 4 Tr x, where Tr x denotes x + x, 


we have 
Nx (2ni/m) Tr x 
—}) = —— le ; (3.12) 
: ((r)) a ( my ) 


Proposition 4 will follow as an immediate consequence of the following 
lemmas, which will be proved below. 


Lemma 1. 


(; ;)): n odd; 
I(Xn) = 
rs some ( .\). n = 2no even. 


mm mm l(a) =( (a) () 


Lemma 3. /f p is an odd prime, then g ((;)) =p. 


PROOF OF LEMMA |. First suppose that n is odd. Write x in the form x = 
(2 + 2i)x, +x, where x, runs through a set of representatives of Z[i] 
modulo n and x, runs through a set of representatives of Z[i] modulo 
2 + 2i. By the Chinese remainder theorem, x then runs through a set of 
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representatives of Z[i] modulo (2 + 2i)n. By (3.10), we have y,(x) = 
14 (nx) (NCH20*)), By (3.4), we have x} (n) = Gi). Also, N((2 + 2i)x,) = 
8Nx,, and so the second term becomes (2%**). Meanwhile, in the additive 
character we have Re(x/n’) = Re(x,/n + x,/(2 + 2i)). Hence, in the defini- 
tion (3.9) of g(x;,) we have 


a me 2 , Nx, 2Qni Re(x,/n)+ 2ni Re(x>/(2+2i)) 
atx ~( = )(7) » | = Je : 


x,eZ[i}jn 
x) eZ [i]/(2+ 21) 


and the double sum on the right separates out into g(y1)9(@). 
The proof for even n is very similar, where we write x = 4x, + 9X3. 
The details will be left as an exercise. oO 


PROOF OF LEMMA 2. The proof is quite similar to that of Lemma 1. In the 
definition (3.11) we write x = x,m, + X,m,, where x; runs through a set of 
representatives of Z[i]/m;, j= 1, 2. Since Nx =m3Nx, (mod m,) and 
Nx = mZNx, (mod m,), we have 


Nx\  /Nx\(Nx\_ /Nx,)\ (Nx, 
mj) \m,)\m,)~ \m,)\m, ) 
Since also Re(x/m) = Re(x,/m,) + Re(x,/m,) the sum in (3.11) separates 


out into a product over x, which is equal to g((m#,)) and a product over x, 
which is equal to g((m)). Qa 


PROOF OF LEMMA 3. We first consider the case p = 1 (mod 4). Let p = BB, 
where B = a + bi. In (3.11), we write x = x,B + x,B, where x, and x, each 
run through 0, 1, 2, ..., p — 1 (note that these numbers are representatives 
of Z[i]/B and also of Z[i]/B). Again, since B and are relatively prime, the 
Chinese remainder theorem tells us that x will run over Z[i]/p. We have 
Nx = (x,B + x,B)(x,B + x,B) = 2x,x, Re B? (mod p). But Re f? = a? — 
b? = 2a’ (mod p), since p = a” + b. Thus, since Re x = x,a + x a, we have 


by (3.11) 
=, ax aX (2ni/p)(ax ,+ax5) 
g ae _ Saree A a 
((;)) a L ) 


The double sum separates out into the square of a single sum over x, €Z/pZ. 
If we then replace ax, by x, we obtain 


(O)-,0e"9 


which we know equals p by property (2) of Gauss sums for finite fields (see 
§II.2; also see Problem 4 of §I.2). 

Finally, suppose that p = 3 (mod 4). Then (p) is a prime ideal of Z[i], 
and Z[i]/p is the field of p? elements. In that case, g((>)) in (3.12) is the 
Gauss sum for the multiplicative and additive characters of F,2 obtained 
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from the multiplicative character ($) and additive character e?"? of F, 
using the norm and the trace. In other words, we are in the situation of the 
Hasse—Davenport relation (2.10), which tells us that —g((>)) is the square 
of the Gauss sum 2,.;_ (#)e?"*/”. Again using Problem 4 of §II.2 (this time 
with g = p = 3 (mod 4)), we conclude that g((s)) = p. 

This completes the proof of the lemmas, and hence of Proposition 4. oO 


In Proposition 4, the term (>) for n odd, (72) for n even, is equal to +1 
if nm = 1, 2, 3 (mod 8) and is equal to —1 if n = 5, 6, 7 (mod 8). This sign 
will turn out to play a crucial role in the functional equation for the Hasse— 
Weil L-series for E,. It is called the ‘‘root number’’. If it equals —1, then 
conjecturally it follows that m must be a congruent number. But there is no 
known direct reason why any squarefree n congruent to 5, 6, or 7 modulo 8 
should be the area of a rational right triangle. 


PROBLEMS 

1. Using (3.6), prove Proposition 3 for nm = 2no even. 

2. Verify the formula in Proposition 4 for n = 1, 2 by a direct computation. 
3. Prove Lemma | for even n. 


4. Give another proof of Lemma 3 directly from the definition of g((;)). 


§4. The prototype: the Riemann zeta-function 


For Re s > 1, the Riemann zeta-function is defined by the convergent infinite 
sum of reciprocal s-th powers, or alternatively by the product of “Euler 
factors” 1/(1 — p-*) with the product over all primes p (see (3.1)). In this 
section we give a proof of analytic continuation and the functional equation 
for the Riemann zeta-function ¢(s). The proof has all of the essential elements 
that will later be needed to prove analogous facts about the Hasse—Weil 
L-function of E,. 

We start by recalling some basic tools for working with real- and complex- 
valued functions. First, we summarize the properties of the gamma-function 
(for the proofs and further details, see, for example, [Whittaker and Watson 
1958, Chapter XII], or [Artin 1964]). 

The gamma-function I"(s) interpolates n! in the sense that (7) = (7 — 1)!. 
It can be defined for seC with Re s > 0 by the integral 


roa | entgse, (4.1) 
¢€ 6 t 
It satisfies the relation 


T(s + 1) =sT(s), (4.2) 
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which enables one to continue I'(s) analytically onto all of the complex 
s-plane, except that it has simple poles at s=0, —1, —2, —3, .... The 
gamma-function also satisfies the relations 


Tu 


T(s)rd —s) = aCe (4.3) 
and 
r(5)r (: ss ' = /n2)-T(s). (4.4) 


Finally, using (4.2) and (4.3), one easily sees that the reciprocal of the gamma- 
function is an entire function of s. 

The gamma-function (4.1) is a special case of a construction known as the 
“Mellin transform’. Given a function f(t) on the positive real axis, its 
Mellin transform is the function g(s) defined by the formula 


a= | sort (4.5) 
0 


for values of s for which the integral converges. Thus, I'(s) is the Mellin 
transform of e~'. Notice that for any constant c > 0, the Mellin transform 
of e “isc *T(s): 


[e eel =¢T@); (4.6) 


as we see after a simple change of variables. We shall often have occasion 
to use (4.6). 

Another tool we shall need is the Fourier transform. Let ¥ be the vector 
space of infinitely differentiable functions f: Rt C which decrease at infinity 
faster than any negative power function, i.e., wis) —0as x—> +0 for 
all N. An example of such a function is f(x) = e~**’. For any fe we define 
its Fourier transform f by: 


$N% i eo 2m F(x)dx. (4.7) 


It is not hard to show that the integral converges for all y, and fe Y. 
The following properties of the Fourier transform are also easy to verify: 


(1) If aeR and g(x) = f(x + a), then g(y) = e?"f(y). 
(2) If aeR and g(x) = e*"f(x), then g(y) = f(y — a). 
(3) If b > 0 and g(x) = f(bx), then g(y) = 4 /(9/0). 


For example, to check (3), we compute 


oy =|" em 7bxyas = [7 omemreo $F = 5 fom 
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Proposition 5. If f(x) = e~*”’, then f =f. 


Proor. Differentiating under the integral sign, we have 


4 _ da <2] 
fo=% | 


co 


eA Yds ~2ni | e728 QB dy, 
Integrating by parts gives 
fy) = —2mni e7 27xY 1 eum 
2n 


4 rai | —2niy e~2t!*ve __dy 
—2n 


—oO 


= —2ny | e 29f(x)dx = —2ny f(y). 
Thus, / satisfies the differential equation f’(y)/f(y) = —2ny; this clearly has 
solution f(y) = Ce~™”, where C is obtained by setting y = 0: 


C=f(0) = ie ede 1, 


— oO 


(Recall the evaluation of the latter integral: 


C= | ewde| e dy = | e+ dxdy 
— 0 —2 R? 


= \ e7™” Onr dr = | e “du = 1.) 
ie) 10) 
Thus, f(y) = e~™, as claimed. Oo 


Proposition 6 (Poisson Summation Formula). [fge/, then 
XY gm= YL Gm). (4.8) 


Proor. Define h(x) = L?__,,g(x +k). The function h(x) is periodic with 
period 1, and has Fourier series h(x) = L®__ , Cne?”""*, where 


1 1 ow oo) 
Coe | h(xye7""""dx = | Y gixt ke 2 dx = | g(xje 27" dx, 
0 — 00 


0 k=—0 
where we interchanged summation and integration, and made a change of 
variables (replacing x +k by x) to obtain the last equality. But the last 
expression is simply g(m). Now the left side of (4.8) is h(0), by definition ; 
and the right side is also A(0), as we see by substituting x = 0 in the Fourier 
series for h(x) and using the fact that c,, = g(m). im 


We now define the theta-function: 


a= Yet for 1>0. (4.9) 


n=—0 
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Proposition 7. The theta-function satisfies the functional equation 


6(t) = 01 it). (4.10) 


Ji 


Proor. We apply Poisson summation to g(x) = e-™” for fixed t> 0. We 
write g(x) = fG/tx) with f(x) = e~™”. By Proposition 5 and property (3) 
of the Fourier transform (with b = \/t) we have g(y) = 171?e"™"". Then 
the left side of (4.8) is @(f), and the right side is t~70(1/1). This proves the 
proposition. oO 


We sometimes want to consider 6(t) for complex t, where we assume that 
Re t > 0 in the definition (4.9). The functional equation (4.10) still holds 
for complex ¢, by the principle of analytic continuation of identities. That is, 
both sides of (4.10) are analytic functions of ¢ on the right half-plane. Since 
they agree on the positive real axis, they must be equal everywhere for 
Ret>0. 


Proposition 8. As t approaches zero from above, we have 
a(t) — 1? <e (4.11) 
for some positive constant C. 
Proor. By (4.10) and (4.9), the left side is equal to 21-1? X*, e~™””". Suppose 
t is small enough so that ,/t > 4e~ and also e~3"/' < 4. Then 
|) — 1? | <gett(e™ + ett oc ge Mt e+ atetoc) 
= eo (nA. 

Thus, we can take C=2— 1. ia 

We now relate 6(t) to the Riemann zeta-function. Roughly speaking, C(s) 
is the Mellin transform of @(f). The functional equation for 0(¢) then leads 


us to the functional equation for ¢(s), and at the same time gives analytic 
continuation of f(s). We now show how this works. 


Theorem. The Riemann zeta-function C(s) defined by (3.1) for Re s > 1 extends 
analytically onto the whole complex s-plane, except for a simple pole at s = | 
with residue 1. Let 


on S 
A(s) =a eo (5) C(s). (4.12) 
Then A(s) is invariant under replacing s by | — s: 
A(s) = AC — s). 


That is, €(s) satisfies the functional equation 
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mer(3 5)s = = ql ene *)ea — 5). (4.13) 


Proor. Basically, what we want to do is consider the Mellin transform 
J? a(t4. However, for large ¢ the theta-function is asymptotic to 1 
(since all except the 7 = 0 term in (4.9) decrease rapidly); and for ¢ near 0 
it looks like t-"?, by Proposition 8. Hence, we must introduce correction 
terms if we want convergence at both ends. In addition, we replace s by $ 
(otherwise, we would end up with ¢(2s)). So we define 


foo) 1 
o(s) = t?(6(t) — ees + | 4? (0% — a\¢. (4.14) 
“ 1 0 Jt f 
In the first integral, the expression 0(t) — 1 = 222, e~*"' approaches 


zero rapidly at infinity. So the integral converges, and can be evaluated 
term by term, for any s. Similarly, Proposition 8 implies that the second 
integral converges for any s. In any case, since 0(t) is bounded by a constant 
times ¢-? in the interval (0, 1], if we take s with Re s > 1 we can evaluate 
the second integral as 


1 1 1 
ij p2gr) Ht an | ps1 at at | pana = ee 
6 t 3 t ‘ t s—]l 


Thus, for s in the half-plane Re s > 1, we obtain: 
00 ioe) 1 oo f 
#(s) =? >: onnntys/2 Al 4 pirat 42 Y onrntys/2 At ea, 2 
rah t et el te s=4 
dt 2 2 


Sak 2 yi em mend ar + 
n=1 Jo 1 _ x 


Using (4.6) with c replaced by mn? and s replaced by o we have: 


51) = z cany-r (5) +242 


—s 
(4.15) 
1 


s 


=«~r($ Joo +3 +7 


where always here Re s > 1. 

Now @(s) is an entire function of s, since the integrals in (4.13) converge 
so well for any s, as we saw. Thus, (4.14) shows us that there is a meromorphic 
function of s on the whole complex plane, namely 


ni | 
a3 (9) arene 


which is equal to ¢(s) for Re s > 1. Moreover, since 2°”, 1/4), and $(s) 
are all entire functions, it follows that the only possible poles are at s = 0 
and at s=1. But near s = 0 we can replace sI'(S) in the denominator by 
2($)G) = 21 + 1), which remains nonzero as s > 0. Hence the only pole 
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is at s = 1, where we compute the residue 


ni? (1 1 1 \. We | 
as Dam (500) aa ae 7 “Tap! 


It remains to prove the functional equation. Since, by (4.15), A(s) = 
3$(s) — ¥ — qty, and since § + q+» is invariant under replacing s by 1 — s, 
it suffices to prove that (s) = @(1 — 5). This is where we use the functional 
equation (4.10) for the theta-function. Using (4.10) and replacing ¢ by + in 
(4.14), we obtain (note that d(+)/(+) = —%, and |? becomes J? = —J§ under 
the substitution): 


avn) Peel) 


(reptacing t by *) 


eer dt a —s/2 dt 
=] (Jy -)— + | POO -—VO— (by 4.10)) 
1 


0 


Wace i Sie eee mes 
-| pi-9/2 (a -~)#+ [ PAD) — Y= 


= o(1-5). 


This completes the proof of the theorem. oO 


In a similar way one can prove analytic continuation and a functional 
equation for the more general series obtained by inserting a Dirichlet 
character y(n) before n~* in (3.1), or, equivalently, inserting y(p) before p* 
in the Euler product (see Problem | below). That is, for any character 
x: (Z/NZ)* > C*, one defines: 


Ly y= ¥ OO _ where Res>1). (4.1 

oe 2 n lor ieekena ty: 
The details of the proof of analytic continuation and the functional equation 
will be outlined in the form of problems below. 

The Hasse—Weil L-function for our elliptic curve E,,, to be defined in the 
next section, will also turn out to be a series similar to (4.16), except that 
the summation will be over Gaussian integers x, the denominator will be 
the norm of x to the s-th power, and the numerator will be 7,(x), where {,, 
was defined in (3.3) in the last section. The techniques used in this section 
to treat the Riemann zeta-function can be modified to give analogous 
facts—analytic continuation and a functional equation—for the Hasse—Weil 
L-function for E,. In the final section we shall use this information to 
investigate the “‘critical value’’ of the Hasse—Weil L-function, which is 
related to the congruent number problem. 
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PROBLEMS 


1. (a) Show that the summation form and the Euler product form of the definition of 
L(y, Ss) are equal. 
(b) Prove that if y is nontrivial, then the sum in (4.15) actually converges (condi- 
tionally) for Re s > 0. 


2. Let G = Z/NZ, and let € = e?7!. 
(a) Define the “finite group Fourier transform” of a function f: G > C by setting 
S(@) = Xyeg h(E for aeG. Prove that f(b) = FZ, -¢ flaye”. 
(b) For fixed seC with Re s > 1, let {: GC be the function 
f(b) = ee ee 
n>1,n=b(mod N) 
Prove that for any primitive Dirichlet character y modulo N and for any s with 
Res>1: 
1 2 a Gi 
LY, s) =— —— 
(8) = 3,90) 2 x(a) 2 a 
where g(x) is the Gauss sum (see Problem 9 of §II.2). 
(c) Take the limit in part (b) as s approaches 1 from above, supposing y nontrivial. 
In that way derive a simple formula for L(y, 1). 
(d) Define the “dilogarithm” function by (x) = £2, = for |x| < 1. Express L(x, 2) 
in terms of the dilogarithm. 


3. (a) For fixed f > 0 and aeR, what is the Fourier transform of e7~™*+® ? 
(b) Suppose that ae R is in the open interval (0, 1). Define the following functions: 


{as)= Vata’, Res>1; 
n=0 

l(a, 5) = Y n Se2nina, Res> 1; 
n=1 


foe) 
6,(t)= ¥ evmrta? tO; 


n= — 0 


0°(t) = y e2nina nin? t>0. 
(The notation /(a, s) should not be confused with the function /(x) in Problem 2.) 
Prove that 
(i) 0,(¢) = 0-470°(4); 
(ii) |0,(2) — t-¥?| < e~* as t +0 for some positive constant C, ; 
(iii) |8°(2)| < e~ ©" as 1 > 0 for some positive constant C). 
(c) Prove that (a, s) + €(1 — a, s) as a function of seC extends to a meromorphic 
function with no pole except for a simple pole at s = 1; that the function /(a, s) + 
/(1 — a, s) extends to an entire function; and that 


xar(s) (C(a, s) + (1 — a, 8)) 


= sl a) ((a, 1 —s) +10 —a,1—s)). 
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(d) Let xy be a primitive Dirichlet character mod N. Let L(y, s) be defined as in (4.16). 
Prove that for Res > 1: 


@ y n00ne( Fs) = NL ); 


O<h<N 


Gi) sdtol(F.5) = a00La, 9. 
O<h<N 

(e) Suppose that x is a nontrivial even character, i.e., y(— 1) = 1. Prove that L(y, s) 
extends to an entire function of seC, and find a functional equation relating 
Ly, 8) to L(Z, 1 — s). 

(f) Let x be a primitive even quadratic character, i.e., y(n) = +1. Recall from 
Problem 9(c) of §II.2 that g(x)” = N, so that g(x) = +./N. Suppose you some- 
how knew that L(y, 3) # 0. Show that this implies that g(x) = //N. 

(g) With x as in part (e), show that L(x, s) = Oif sis an even negative integer or zero. 

(h) With x as in part (e), express L’(y, — 2k) in terms of L(¥, 2k + 1). In particular, 
express L’(z, 0) in terms of L(x, 1). 


4. Let x be a nontrivial even primitive Dirichlet character mod N, and define 


0% 0= Y xine =; Y xem", = t > 0. 
n=1 


neZ 


(a) Prove that 


N 
(i) O(, 0 = 5 Y agn(N22): 
a=1 
1 N 
(i) 5 0%) = OG, 0: 
a=1 


ws 90) ges iiN2 
(ii) O(y, t) = = 0, 1/N 70). 
) 0% 9 UNG x ) 
(b) Show that the Mellin transform of 6(x, t) converges for any s (with no need for 
any correction term), and that for Re s > } it is equal to 2 ‘“T'(s) L(y, 2s). 
(c) Use the functional equation in part (a) (iii) to give another proof of the functional 
equation for L(x, s) in Problem 3(e) above. 


5. (a) Let fe Y, and g(x) = f’(x). Show that g(y) = 2niyf(y). 
(b) Find the Fourier transform of (x + a)e~™**”, with ¢ and a as in Problem 3(a). 
(c) Let ae (0, 1). Define C(a, s) and /(a, s) as in Problem 3 above, but now define 6, 
and 0° differently: 


6(t)= Y (ntayer™mro, t>0; 


n=—o@ 


co} 
Os S nernnere p> 0, 
n=—o 
Prove that: 
(i) 0,(2) = —it~*70%(7); 
(ii) |0,(t)| < eas t + 0 for some positive constant C, ; 
(iii) |0%(2)| < e~ ©" as t > 0 for some positive constant C). 
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(d) Express the Mellin transform of 6, and 6% in terms of €(a, s) and /(a, s); prove 
that €(a, s) — €(1 — a, s) and /(a, s) — (1 — a, s) extend to entire functions of 
séC; and derive a functional equation relating these two functions. 

(e) Suppose that y is a primitive odd character mod N, 1.e., 7(—1) = —1. Prove that 
L(y, s) extends to an entire function of seC, and find a functional equation 
relating L(y, s) to L(Z, 1 — s). 

(f) Let x be an odd quadratic character. Show that if you somehow knew that 
L(y, 1/2) # 0, then this would imply that g(y) = i,/N (rather than —i,/N). 

(g) With x as in part (e), show that L(y, s) = 0 if s is a negative odd integer. 

(h) With x as in part (e), express L’(y, 1 — 2k) in terms of L(x, 2k). In particular, 
express L’(y, —1) in terms of the dilogarithm. For example, express L’(y, — 1), 
where 7(”) = (4), in terms of the dilogarithm. 


6. Let x be an odd primitive Dirichlet character mod N, and define 


m~O= > ny(nye-™ = 5 » ny(nye™”, t>0. 
n=1 neZ 
(Note that this is different from the definition of 0(y, #) for even x in Problem 4.) 
Let 6, and 6? be as in Problem 5(c). 
(a) Prove that: 
N N 
5 L(A) Baw(N 78); 


a=1 


(i) 0%. = 


N 
(i) 5 Y ao" = (OG, 0: 


(iii) O0(y, ) = —iN~21-3? g (QO, 1/N70. 


(b) Show that the Mellin transform of 6(y, 4) converges for any s, and that for 
Re s > Fit is equal to m “T'(s)L(y, 2s — 1). 

(c) Use the functional equation in part (a)(iii) to give another proof of the func- 
tional equation for L(y, s) in Problem S5(e) above. 


7. Let x be the character mod 12 such that y(+1) = 1, x(+5) = —1. Let n(z) Zou 
—iz/12) for Im z > 0. Prove that 4(—1/z) = Jali n(z), where we take the branch of 
Vzii which has value | when z = i. We shall later encounter 4(z) again, and give a 
different expression for it and a different proof of its functional equation. 


8. (a) Use the functional equations derived above to express /(a, 1 — s) in terms of 
C(a, s) and (1 — a, s). 
(b) Use the properties (4.3) and (4.4) of the gamma-function along with part (a) to 
show that 


Ka, 1 — s) = V(s)(2n) Se"? (C(a, 8) + e™E(1 — a, 5). 


(c) ForaeC, a+ —n, define (a + n)~* to mean e*'°8¢*”, where we take the branch 
of log having imaginary part in (—z, 2]. Show that foraeC, Ima>0,Res> 1, 
one has: 


K(a, 1 — s) = V'(s)(22) Seis"? 3 (a+n)*. 


n=—0 


§5. The Hasse—Weil L-function and its functional equation 79 


(d) Let s = k be a positive even integer. Show that for aeC, Ima > 0: 


= 1 Qnit 2 ay 
= n e 
n= sa (a + n)* (k — 1) i 


2nina 


(e) Give a second derivation of the formula in part (d) by successively differentiating 
the formula 
1 


a—n 


1 = 1 
mT cot(za) = — + + 
ve a 2 a+n 


§5. The Hasse—Weil L-function and its functional 
equation 


Earlier in this chapter we studied the congruence zeta-function Z(E/F,; T) 
for our elliptic curves E,,: y? = x? — n?x. That function was defined by a 
generating series made up from the number N, = N,,, of F,,.-points on the 
elliptic curve reduced mod p. We now combine these functions for all p to 
obtain a function which incorporates the numbers WN, , for all possible prime 
powers p’, i.e., the numbers of points on £, over all finite fields. 
Let s be a complex variable. We make the substitution T= p™* in 
Z(E,/F,; T), and define the Hasse~Weil L-function L(E,,, s) as follows: 
C(s)o(s — 1) 
L(E = 
Cw DTT Z E/E P 
1 
= = = (5.1) 
dy deg pT pee? 
= 1 
P}2n 1— apes ?(NP) 


(5.2) 


We must first explain the meaning of these products, why they are equiv- 
alent, and what restriction on se€C will ensure convergence. In (5.1) we are 
using the form of the congruence zeta-function in the theorem in §II.2 (see the 
first equality in (2.7)), where the notation a, , indicates that the coefficient 
a depends on £, and also on the prime p. We put the term C(s)¢(s — 1) in 
the definition so that the uninteresting part of the congruence zeta-function— 
its denominator—disappears, as we see immediately by replacing ¢(s) and 
€(s — 1) by their Euler products (see (3.1)). Note that when p|2n, the deno- 
minator term is all there is (see Problem 10 of §II.1), so we only have a 
contribution of | to the product in that case; so those primes do not appear 
in the product in (5.1). 

In (5.2) the product is over all prime ideals P of Z[i] which divide primes 
Pp of good reduction. Recall that those primes are of two types: P = (p), 
p =3 (mod 4), deg P=2, NP=p?; and P=(a+ bi), a +b*=p=1 
(mod 4), deg P= 1, NP =p. The meaning of op and the equivalence of 
(5.1) and (5.2) are contained in Proposition 1 (see (3.2)). 
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As in the case of the Riemann zeta-function, we can expand the Euler 
product, writing each term as a geometric series and multiplying all of the 
geometric series corresponding to each prime. The result is a Dirichlet 
series, i.e., a series of the form 


L(E,, 8) = Y Bm. m. (5.3) 
m=1 


Before discussing the “‘additive” form of L(£,, s) in detail, let us work out 
ae values of the first few 5,,., for the example of the elliptic curve E,: y? = 
x°* — x. We first compute the first few values of ag, , in (5.1). If p = 3 (mod 4), 
thena,, ,=0.Ifp=1 (noe 4), there are two easy ways to computed = a, , 
(1) as the solution to a? + b? = p for which a + bi = 1 (mod 2 + 21); (2) 
after counting the number N, of F,-points on E,, we have 2a=p+1—N, 
(see (1.5)). Here is the result: 


1 1 1 1 
143-°9% 142-5 545-25 147-49 1411-121- 

é l . l see 
1—6-13-°+13-169°§ 1 —2-17-° + 17-289" 
=1—2-55—3-946-13%4+2-17%°+ Yb, (5.4) 


m>25 


L(E,, 5) = 


We have not yet discussed convergence of the series or product for L(E,, 5). 
Using (5.2) and the standard criterion for an infinite product to converge 
to a nonzero value, we are led to consider Xp|ap|**8?(NP)-* for s real. By 
peueaes 1, we have |ap|*®” = NP*?. In addition, NP'?~* < p’?* for 

4 (where P = (p) or else PP = (p)). Since there are at most two P’s for 
a p, it follows that the sum is bounded by 22, p'?~*, which converges if 
Res > 3. To summarize, the right half-plane of guaranteed convergence is 
1/2 to the right of the right half-plane of convergence for the Riemann 
zeta-function, because we have a term of absolute value JP in the Euler 
product which was absent in the case of ¢(s). 

We now discuss the additive form of L(E,, s) in more detail. Using 
Proposition 2, we can rewrite (5.2) in terms of the map 7, defined in (3.3)— 
(3.4): 


= iAP) \" 
L(E,, 5) ane (: i) ) ; (5.5) 
where we have used ¥,(P) to denote its value at any generator of the ideal P. 
Notice that, since 7, is a multiplicative map taking the value | at all four 
units +1, +i, we may regard it equally well as a map on elements x of 
Z[i] or on ideals J. 

We can now expand the product (5.5) in the same way one does for the 
Riemann zeta-function and for Dirichlet L-series (see Problem 1(a) in the 
last section). We use the two facts: (1) every ideal J has a unique factorization 
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as a product of prime power ideals; and (2) both , and N are multiplicative: 
XL, b) = %,(1) iL), NUL) = NI,-NZ. Then, by multiplying out the 
geometric series, we obtain: 


L(E,, 8) = Lin (ND), (5.6) 
I 


where the sum is over all nonzero ideals of Z[i]. 

A series of the form (5.6) is called a ““Hecke L-series”, and the map J, is 
an example of a “Hecke character”. In a Hecke L-series, the sum on the 
right of (5.6) is taken over all nonzero ideals in some number ring. A multi- 
plicative map x on the ideals in that ring is said to be a Hecke character if 
the following condition holds. There is some fixed ideal f and a fixed set of 
integers n,, one for each imbedding o of the number field into Q***!, such 
that if J is a principal ideal generated by an element x which is congruent to 
1 modulo the ideal f, then x(/) = II,o(x)"*. In our example, the number 
ring is Z[i]; there are two imbeddings o, = identity, 0. = complex conju- 
gation in Gal(Q[7]/Q); we take n, =1, n,,=0; and we take f = (7’) 
(n’ = (2 + 2i)n if n is odd, 2n if n is even). Then the condition simply states 
that 7,((x)) = x if x = 1 (mod n’). 

It is very useful when the Hasse—Weil L-series of an elliptic curve turns 
out to be a Hecke L-series. In that case one can work with it much as with 
Dirichlet L-series, for example, proving analytic continuation and a func- 
tional equation. It can be shown that the Hasse—Weil L-series of an elliptic 
curve with complex multiplication (see Problem 8 of §I.8) is always a Hecke 
L-series. 

The relation between the additive form (5.6) and the additive form (5.3) 
is quite simple. We obtain (5.3) by collecting all terms corresponding to 
ideals J with the same norm, i.e., 


Disa De ta Ds 


Iwith NI=m 


Notice that, since 7,(/) = %, (J) - (x7) by (3.3), we have 


n i n 
ter CF) Salt = Ci) 


where we have denoted 8,, = b,,,. Thus, if for fixed n we let x, denote the 
multiplicative map on Z given by m+> (q) (for m prime to 2), we have 


2 
b= 2(2)s~ = 3(2) Orta: (4) isa 2(4) Var eee 
5 3 B 17 (5.7) 


(note: (3)? is 1 if 3/n and 0 if 3|n); one says that L(E,, s) is a “twisting” 
of L(E,, s) = Xb,,m~* by the character y,. One can verify that for n square- 
free, the conductor of x, is m when n = | (mod 4) and is 4m when n = 2 or 3 


L(E,, 8) = 


| 
Ms 
Pad 
x 
3 
— 
=~ 
3 
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mod 4 (this follows from quadratic reciprocity). In other words, y, is a 
primitive Dirichlet character modulo x or 4n. 

To keep the notation clear in our minds, let us review the meaning of y,,, 
x, and Z,. First, zy, is a map from Z to {+1, 0} which is defined by the 
Legendre symbol on integers prime to 2n. Second, yj, is a map from Z[i] 
to {+1, +i, 0} which takes elements x prime to 2 to the unique power of i 
such that y/,(x)x = 7,(Nx) modulo 2 + 27 (see (3.3)—(3.4)). Thirdly, ¥, is a 
map from Z[i]| to Z[i] which takes an element x to xy7;(x); also, Z,, can be 
regarded as a map from ideals of Z[i] to elements of Z[i] which takes an 
ideal J prime to 2” to the unique generator of J which is congruent to 7,(NJ) 
modulo 2 + 21. 

The character x, is intimately connected with the quadratic field Q(./n). 
Namely, if # = p # 2 is a prime number, then the value of 7,(p) = (¢) shows 
whether p splits into a product of two prime ideals (p) = P,P, in Q(./n) 
(this happens if (3) = 1), remains prime (if (3) = —1), or ramifies (p) = P? 
(if (5) = 0, i.e., p|n). (See [Borevich and Shafarevich 1966].) We say that y, 
is the quadratic character associated to the field Q(,/n). 

It is not surprising that the character corresponding to the field Q(J/n) 
appears in the formula (5.7) which links L(E,, s) with L(E,, s). In fact, 
if we allow ourselves to make a linear change of variables with coefficients in 
Q(,/n), then we can transform E,: y? = x° —n°’x to E,: y? =x — x’ by 
setting y = na/ny’, x = nx’. One says that E,, and £, are isomorphic ‘‘over 
the field Q(/n).” 

Returning now to the expression (5.6) for L(E,, 5), we see that it can also 
be written as a sum over elements of Z[i] rather than ideals. We simply note 
that every nonzero ideal has four generators, and so appears four times if 
we list elements instead of ideals. Thus, 


1 


Ban aaa’ » VAG F bi), 
4 a+ biwitha?+b2=m 
and 
1 ag 
LE, 8) =7 Dy dn(x)(Nx) 
xeZ[i] 


(5.8) 


1 (a + bi)x(a + bi) 
4 atbiezti) (a? + b?)° , 


where y, was defined in (3.3)—(3.4). (The sums are over nonzero x, a + bi.) 

Notice the analogy between the sum (5.8) and Dirichlet L-series. The only 
differences are that the number ring is Z[i] rather than Z, and our Hecke 
character ¥,,(x) includes an ordinary character y/,(x) (with values in the roots 
of unity) multiplied by x. 

We now proceed to show that L(E,, s) can be analytically continued to the 
left of Re s = 3, in fact, to an entire function on the whole complex plane; 
and that it satisfies a functional equation relating L(E,, s) to L(E,, 2 — s). 
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Since L(E,, s) is a ‘‘two-dimensional” sum over Z[i] ~ Z’, i.e., over pairs 
of integers rather than integers, it follows that we shall need to look at 
Fourier transforms, the Poisson summation formula, and theta-functions 
in two variables. We shall give the necessary ingredients as a sequence of 
propositions whose proofs are no harder than the analogous results we 
proved in the last section for the case of one variable. 

Since the definitions and properties we need in two dimensions are just 
as easy to state and prove in n dimensions, we shall consider functions on R”. 
For now, n will denote the number of variables (not to be confused with 
our use of n when writing E,: y? = x? —n?x, x,, etc.). We will use x = 
(x,,.--,X,) and y =(j,, ...,y,) to denote vectors in R”. As usual, we let 
XY SXyVy t+ +XpN_s [x] =/x-x. We shall also use the dot-product 
notation when the vectors are in C”; for example, ifm = 2 we have x: (1, i) = 
Xy Ht Xyi. 

Let ¥ be the vector space of functions f: R" + C which are bounded, 
smooth (i.e., all partial derivatives exist and are continuous), and rapidly 
decreasing (i.e., |x|“f(x) approaches zero whenever |x| approaches infinity 
for any NV). For fe we define the Fourier transform f/: R” > C as follows 
(where dx denotes dx ,dx, --- dx,): 


fo)= | oF), (5.9) 


This integral converges for all ye R”, and fe Y. 


Proposition 9. Let f: R" > C, g: R" > C be functions in Ff. 


(1) IfaeR" and g(x) = f(x + a), then g(y) = erniavé(y), 
(2) IfaeR" and g(x) = e?""**f(x), then G(y) = f(y — a). 
(3) If beER, b > 0, and g(x) = f(bx), then G(y) = b-"f(y/b). 
(4) If f(x) =e-™™, then f = f. 


Proposition 10 (Poisson Summation Formula). Ifge YS, then 


dy gm) = ¥ Gm). 
mez" mez" 

The proofs of Propositions 9 and 10 are completely similar to those of pro- 
perties (1)—(3) of the Fourier transform in one variable and Propositions 5 
and 6 of the last section. One simply has to proceed one variable at a time. 

If weC” and fe Y, we let w- at Wis + Wat Bete op Wales 


Proposition 11. If fe.Y and g = w: &f, then G(y) = 2niw- y f(y). 


Proor. Since both sides of the equality are linear in w, it suffices to prove 
the proposition when w is the j-th standard basis vector, i.e., to prove that 
the Fourier transform of a5; S(x) is 2niy; f(y). This is easily done by sub- 
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stituting ax; a& f(x) in place of f(x) in (5.9) and integrating by parts with respect 
to the j-th ‘variable (see Problem 5(a) in the last section). a 


For the rest of this section, we take n = 2 in Propositions 9-11, and we 
return to our earlier use of the letter n in E,,, x, etc. 


Theorem. The Hasse—Weil L-function L(E,,, s) for the elliptic curve E,: 
y? = x3 —n’x, which for Re s > 3 is defined by (5.1), extends analytically 
to an entire function on the whole complex s-plane. In addition, let 


% 32n?, nodd; 5.10) 
~ )16n?, 1 even. Ct 
Let 
N 
A(s) = (ey T'(s)L(E,, 5). (5.11) 
Then L(E,,, Ss) satisfies the functional equation 
A(s) = +A(2—5), (5.12) 


where the “‘root number”? +1 is equal to 1 ifn = 1, 2, 3 (mod 8) and is equal 
to —1lifn=5, 6, 7 (mod 8). 


ProoF. The proof is closely parallel to the proof of analytic continuation 
and the functional equation for Dirichlet L-series with odd character, which 
was outlined in Problem 5 of the last section. Namely, we express L(E,, 5), 
written in the form (5.8), in terms of the Mellin transform of a two-dimen- 
sional version of the theta-function 6,(t) defined in Problem 5(c). We shall 
use the letter u rather than a to avoid confusion with the use of a in (5.8). 

Thus, let wu = (u,, u.)€R*, where u¢Z?, and let te R be positive. Let w 
be the fixed vector (1, i)€C7, so that, for example, m-w = m, + myi for 
meZ’. We define: 


A) = ¥ (m+ u)-wemintal, (5.13) 
me Z2 

jHH)= Ym en (5.14) 
me Z? 


Regarding wu and ¢ as fixed, we find a functional equation for 6,(t) by 
means of the Poisson summation formula (Proposition 10); to obtain 6,(4) 
on the left side of Proposition 10, we choose 


g(x) = (x + u) we mata, (5.15) 


To find the Fourier transform of g(), and hence the right side of the Poisson 
summation formula, we proceed in several steps, writing f(x) = er, 
g1(x) =f/tx), g2(x) = w+ 9i(x), and finally g(x) = iig2(x + uw). We 
have: 
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fly) =e? by Proposition 9, part (4); 
g,(”) = te" 19! by Proposition 9, part (3); 
go(y) = 2nit™!w-y el by Proposition 11; 
g(y) = —it-2w-y e2t™%e— 99? by Proposition 9, part (1). 


If we now evaluate g(m) for me Z?, and sum over all m, we obtain the func- 
tional equation 


6,(t) = Sor(2), (5.16) 
t t 

We now consider the Mellin transform of 6,(1): |F 1°0,(t)%, and show that 
the integral converges to an entire function of s. First, for large ¢ it is easy 
to bound the integrand by something of the form e“, using the fact that 
|m + u|* is bounded away from zero, since u is not in Z?. Next, for ¢ near 
zero one uses the functional equation (5.16) and a bound for 6“(4) of the 
form e~‘", where we use the fact that the only term in (5.14) with |m|* = 0 
vanishes because of the factor m-w. These bounds make it a routine matter 
to show that the integral converges for all s, and that the Mellin transform 
is analytic in s. 

If we now take Re s > 3, we can evaluate the Mellin transform integral 
term by term, obtaining a sum that begins to look like our Z-function: 


| 9, = » (m+ uw | prernimea 
0 


0 me 2? 


=n T(s) ue 


me Z? |m + u|?s 


(see (4.6)). 


Now for Re s > 3, we can rewrite L(E,, s) as a linear combination of these 
sums with various uw. 

We now suppose that n is odd. The case n = 2m, even is completely similar, 
and will be left as an exercise below. We take w = (1, i). If we use (5.8) and 
recall that x(x) depends only on x modulo n’ = (2 + 2i)n, and hence, a 
fortiori, only on x modulo 4n, we obtain: 


a+ bi+4nm-w 


1 : 
Li, )=- YY yxfatbi) ¥ a. bye aun? 


4 o<a,b<4n me Z2 


i 7 , : (m+ (4,24))-w 
= 7(4n)'~?8 x(a + bi) (mn + Gn an)) 
4 veoreay 2 im ee ee 


Thus 


ce 1 =? ; Be [ei dt 
T T(s)L(£,, Ss) = riGuoe : » Xnla + vi) | t Qajan,ojan(—- (5.17) 
(a,b) 310.0) 


Since the integral inside the finite sum is an entire function of s, as are the 
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functions (4n)'~*5 and ‘/I'(s), we conclude that L(E,, s) has an analytic 
continuation to an entire function of s. 

Moreover, we can transform this integral using the functional equation 
(5.16) and replacing ¢ by +: 


e dt eee 1\ dt A ere dt 
tO nian, vj4n(l)— =; ts cane —=-j] t? sQalan. diane) 
10) t r?) t t t 


10) 
In the entire function (5.17) we now suppose that Re 2 — s > }(i.e., Re s < 4) 
so that we can evaluate this last integral as an infinite sum. Using (4.6) again, 
inserting the definition (5.14), and interchanging summation and integration, 
we obtain 


foe) 

| p2-sgalenbiancyy Ht —_ ns *T(2 = s) > m: Wee re gl ae, 

t 
0 meZ* 


Thus, for Re 2 — s > 3, the right side of (5.17) is equal to 


m:-Ww 


1 
~i(4n)'- nT —y5 y EY s, (5.18) 
Aes |m|?° ) 
where for me Z? 
Suz XL (at bidel2nienmetaty, (5.19) 


O<a,b<4n 


Lemma. [f m, + m,i is not in the ideal (1 +i), then S,,=0; whereas if 
m,+my,i=(14+i)x with xeZ[i], then S,, = 2x,(x)g(x,), where g(x) is the 
Gauss sum defined in Proposition 4 of §I1.3 (see (3.9)). 


Before proving the lemma, we show how the functional equation in the 
theorem follows immediately from it. Namely, if we make the substitution 
m-w=m,+m,i= (1 + i)x in the sum in (5.18), the lemma gives us 

x m-w 2(1 + i)x 


paa-9m = > Trop a) dace LG Xn) 


mez? |M xeZ[i] \(1 + xy 


=(1+ 127 (52) +20 YZ) (Nx 29 
n xeZ[i] 


by Proposition 4. But this last sum is 4Z(E,, 2 — s), by (5.8). Bringing this 
all together, we conclude that for Re 2 — s > } the right side of (5.17) is 
equal to 


— i(4n)'~25n8-2F(2 — s)(1 + i283 (=?) (2 + 2i)nL(E,, 2 — 8) 
(5.20) 
= (>?) n-21 (2 — s)(8n?)! SL(E,, 2 — 5). 


On the other hand, if we bring the term (,/N/2) over to the right in the 
functional equation (5.11)—(5.12) in the theorem, we find that what we want 
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to prove is: 


nT (s)L(E,, 5) = (2) (/N/2)*(22)7(./N)? 1 (2 — s)L(E,, 2 — 5) 


7 (+) (N/4)!8n*-?7F(2 — 5) L(Eq, 2 ~ 3). 


And this is precisely (5.20). 
Thus, to finish the proof of the theorem for odd n, it remains to prove 
the lemma. 


PROOF OF LEMMA. First suppose that m, + m,iis not divisible by 1 + i. This 
is equivalent to saying that m, and m, have opposite parity, i.e., their sum 
is odd. Now as a, b range from 0 to 4n, the Gaussian integer a + bi runs 
through each residue class modulo (2 + 2/)n exactly twice. Each time gives 
the same value of 7/(a + bi), since y,(a + bi) depends only on what a + bi 
is modulo n’ = (2 + 2i)n. But meanwhile, the exponential terms in the two 
summands have opposite sign, causing the two summands to cancel. To see 
this, we observe that if a, + b,iand a, + 5,i are the two Gaussian integers 
in different residue classes modulo 4n but the same residue class modulo 
(2 + 2i)n, then a, + b,i — (a, + 631) = (2 + 2i)n (mod 4n), and so 


e(2ri/4nym-((a;,b,)—(4z,b2)) — e(2xi/4n)m-(2n, 2n) 2. etm, +m) es 1. 


This proves the first part of the lemma. 

Now suppose that m, + m,i = (1 + i)x. Note that m-(a, b) =m,a+my,b 
= Re((m, — m,i)(a + bi)) = Re((1 — i)X(a + bi)). Hence, the exponential 
term in the summand in S,, is ¥(X(a + bi)), where 


— p2ni Re(x/n’) 
W(x) Ga ®& 


(with n’ = (2 + 2i)n). Since x; is a primitive character modulo (2 + 2/)n 
(see Proposition 3), we can apply Problem 9(a)—(b) of §IJ.2. Since the sum- 
mation in (5.19) goes through each residue class modulo (2 + 2/)n twice, 
we have 

m=2 Y Ula t bi) W(Ha + bi) 


a+bieZ[i]/(2+2i)n 
= 27,(%)9 ka) = 2X0) 9(Kn)- 


This proves the lemma, and hence the theorem (except for some slight 
modifications in the case of even n, which will be left as an exercise). Oo 


In the problems we shall outline a proof of the analogous theorem in 
the case of an elliptic curve, namely y? = x? + 16, which has complex multi- 
plication by another quadratic imaginary integer ring, namely Z[w], where 
w = —4+44i,/3. There is one additional feature which is needed because 
we end up summing not over Z[i], which can be thought of as Z?, but 
rather over a lattice which is the image of Z? under a certain 2 x 2-matrix. 
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So we have to apply Poisson summation to a function much like the function 
in this section, but involving this matrix. 

We conclude this section by mentioning two references for a more general 
treatment of the theory of which we have only treated a few special cases. 
First, in C. L. Siegel’s Tata notes [Siegel 1961] (see especially pp. 60-72) 
one finds L-functions whose summand has the form 


e2timu P(m + v) 
(O[m + opr” 


where me Z", u, ve R", Q is the matrix of a positive definite quadratic form, 
and P is a ‘“‘spherical polynomial with respect to Q of degree g”. The case 
we needed for L(E,, s) was:n= 2, 0 =(} 9), P(x,, xX.) =x, +ix,,g=1. 
In Problem 8 below we have the case Q = (1). '7), P(x,, X2) = (@ + 1/2)x, 
+ (w/2 + 1)x (where w = —1/2 + i,/3/2), g = 1. 

In [Lang 1970, Chapters XIII and XIV], two approaches are given to this 
topic. In Ch. XIII, the approach we have used (originally due to Hecke) 
is applied to obtain the functional equation for the Dedekind zeta-function 
of an arbitrary number field. This is a generalization of Problems 2 and 6 
below. However, the case of more general Hecke L-series is not included in 
that chapter. A quite different approach due to J. Tate—using Fourier 
analysis on p-adic fields—is given in Ch. XIV of Lang’s book. 


PROBLEMS 
1. Finish the proof of the theorem for n even. 


2. (a) Find a functional equation for 0(0) = Z,<22 etm? +> 0. 
(b) The Dedekind zeta-function of a number field K is defined as follows: 


xls) = DIN), 


where the sum is over all nonzero ideals / of the ring of integers of K. This sum 
converges for Re s > 1 (see [Borevich and Shafarevich 1966, Ch. 5, §1]). Let 
K = Q(i). Prove that ¢;(s) is an entire function except for a simple pole at s = 1 
with residue 2/4, and find a functional equation relating €x(s) to €x(1 — 5). 


3. For u and v in R?, let 


i ee ence 2 
A(t) = > e2nim ve nt|m+ul| . t>0. 
sd meZ? 


Find a functional equation relating 0/(f) to 0% ,.(4). 


4. (a) In the situation of Proposition 11, express the Fourier transform of (w -2)'f(0) 
in terms of f(y) for any nonnegative integer k. 
(b) Suppose that k is a nonnegative integer, we R? is fixed with u¢ Z?, t > 0 is fixed, 
and w = (1, i)€¢C?. What is the Fourier transform of 


g(x) =((x + u)- w)kennletul? 9 


(c) With k, u, t, w as in part (b), define: 
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Bux) = DL (Um + u)-w)ternimral; 
meZ? 
6*(t) e Y (m : w)ke2nim went? 
me Z? 
Find a functional equation relating 6, ,(t) to 0“*(4). 

(d) Suppose that / is a fixed ideal of Z[i], and x: (Z[i]/1)* > C* is a nontrivial 
character. Outline (without computing the details) how one would prove that 
the function 2, . zy XXX) (Nx) extends to an entire function of s (if x were 
trivial, there would be a simple pole at s = 1), and satisfies a functional equation 
relating its value at s to its value atk + 1—s. 

(e) Explain why any Hecke L-series for Z[i] is essentially of the form in part (d). 


5. Let f: R" > C, fe. 

(a) For MeGL,(R), let M‘ denote the transpose matrix, and let M* = (M~')'. Let 
g(x) = f(Mx). Express g(y) in terms of f(y). 

(b) Let L be a lattice in R"; equivalently, L is of the form L = MZ", where Me 
GL,(R), i.e., L is obtained by applying some matrix M to the elements in the 
standard basis lattice Z". Let L’ be the ‘‘dual lattice” defined by: L’ = {ye 
R"|x-yeZ for all xe L}. First show that L’ is a lattice. Then prove a functional 
equation relating 2,., f(x) to Eee); 


6. Letw = -$+4,/3. 

(a) Let M=(, (7°). Show that the lattice L = Z[w] < C = R? is L = MZ”. What 
are M* and L’? Show that L’, considered as a lattice in C, is twice the “different” 
of Z[w], which is defined as {x € Q(w)|Tr(xy) eZ for all ye Z[w]}. Here Tr x = 
x+x=2Rex. 

(b) Prove that 


Y ~at|x|? 2 Y eo (4n/30)|x/? 
xeZ[w] £/3 ecatal 


(c) Let K = Q(w). Prove that (,(s) extends meromorphically onto the complex s- 
plane with its only pole a simple pole at s = 1. Find the residue at the pole, and 
derive a functional equation for €x(s). 

(d) With K as in part (c), prove the identity: ¢,(s) = C(s)L(y, 5s), where x is the 
nontrivial character of (Z/3Z)*. 

(e) Use part (d), the theorem in §II.4, and Problem 5(e) of §II.4 to give another 
proof of the functional equation in part (c). 


7. Let E be the elliptic curve y? = x? + 16. Letw = —$ + $3. 

(a) Show that the reduction mod p of y? = x° + 16 is an elliptic curve over F, if and 
only if p # 2, 3. For such p, recall that the Euler factor at p is (1 — 2a,p * + 
p'~*5)"!, where (1 — 2a,T + pT”) is the numerator of Z(E/F,; T). Show that 
for p # 2, 3 this factor is 


[Td = af? ), 
P 


where the product is over all (one or two) prime ideals P of Z[w] dividing (p) 
and a$°#” is the unique generator of P such that «#8? = | (mod 3). 
(b) When p = 3, define the Euler factor at 3 to be simply 1, as we did for E,,: y? = 


x? — n?x when p|2n. When p = 2, at first glance it looks like we again have 
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y? = x°, and so we might be tempted to take 1 for the Euler factor at 2 as well. 
However, this is wrong. When one defines L(E, s), if there exists a Q-linear 
change of variables that takes our equation E: y? = x* + 16 in PZ to a curve C 
whose reduction mod p is smooth, then we are obliged to say that E has good 
reduction at p and to form the corresponding Euler factor from the zeta-func- 
tion of C over F,,. In Problem 22 of §I1.2, we saw that y? = x* + 16 is equivalent 
over Q toy? + y = x3, asmooth curve over F, whose zeta-function we computed. 
Show that the Euler factor at p = 2 is given by the same formula as in part (a). 

(c) For xeZ[a@] prime to 3, let y(x) = (—w) be the unique sixth root of | such that 
x(x) = 1 (mod 3). Let (x) = 0 for x in the ideal (,/— 3). Show that 


1 x(x) x 
L(E, s) == : 
: ” 6 gay (NX)! 
(d) Let 


= p(2ni/3) Tr(xfiv3) 
Wix)=e 


where Trx=x+xX=2 Rex. Verify that W(x) is an additive character of 
Z[q@]/3 satisfying the condition in Problem 9 of §II.2 (i.e., that it is nontrivial on 
any larger ideal than (3)). Find the value of g(x, W) = Zxezpoy3 4) (x), where 
y is as in part (b). 


8. (a) Let w = (1, w), we R? — Z?, and t > 0 be fixed. What is the Fourier transform 
of g(x) = (x + u)- we mato wl?9 
(b) Let 6,(2) = 2,<z2g(m), where g(x) is as in part (a). Find the Mellin transform 
p(s) of 
Y x(a + baw) 64,(t). 


u=(a/3,b/3) 
O<a,b<3 


(c) Prove that L(E, s) is an entire function, where E: y? = x°* + 16 is the elliptic 
curve in Problem 7. 
(d) Prove the functional equation A(s) = A(2 — s) with 
Jai 


A(s) (=) T(s)L(E, s). 
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The value at s = 1 of the Hasse—Weil L-function L(E, s) of an elliptic curve 
Eis called the ‘‘critical value’. When we have a functional equation relating 
L(E, s) to L(E, 2 —s), the point s=1 is the ‘“‘center” of the functional 
equation, in the sense that it is the fixed point of the correspondence s>2 — s. 
The importance of this critical value comes from the following famous 
conjecture. 


Conjecture (B. J. Birch and H. P. F. Swinnerton-Dyer). L(E, 1) = 0 if and 
only if E has infinitely many rational points. 


In this conjecture E is any elliptic curve defined over @. In the general 
case it has not even been proved that it makes sense to speak of L(E, 1), 
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because no one has been able to prove analytic continuation of L(E, s) to 
the left of the line Re s = 3. However, analytic continuation and a functional 
equation have been proved for any elliptic curve with complex multiplication 
(see Problem 8 of §1.8), of which our E,, are special cases, and for a broader 
class of elliptic curves with a so-called ‘Weil parametrization”’ by modular 
curves. (It has been conjectured by Weil and Taniyama that the latter class 
actually consists of all elliptic curves defined over Q.) 

We shall call the above conjecture the “weak Birch--Swinnerton-Dyer 
conjecture’’, because Birch and Swinnerton-Dyer made a much more detailed 
conjecture about the behavior of L(E, s) at s = 1. Namely, they conjectured 
that the order of zero is equal to the rank r of the group of rational points 
on E (see the beginning of §1.9). Moreover, they gave a conjectural descrip- 
tion of the coefficient of the first nonvanishing term in the Taylor expansion 
at s = 1 in terms of various subtle arithmetic properties of £. For a more 
detailed discussion of the conjecture of Birch and Swinnerton-Dyer, see 
[Birch 1963], [Birch and Swinnerton-Dyer 1963, 1965], [Cassels 1966], 
[Swinnerton-Dyer 1967], [Tate 1974]. 

There is a simple heuristic argument—far from a proof-—which shows 
why the weak Birch—Swinnerton-Dyer conjecture might be true. Let us 
pretend that the Euler product for L(E, s) (see (5.1) for the case EF = E£,) 
is a convergent infinite product when s = 1 (which it probably isn’t). In that 
case we would have: 


= Tye 


1 
L(E, 1)= 
( ) I] ae = 


pi 2a, Pp 


1-2s ~ =|] 
s=1 

where N, = N,_,=p+ 1 —2a,,, is the nuinber of F,-points on the elliptic 
curve E considered modulo p. Now as p varies, the N, “straddle” p at a 
distance bounded by Delp: This is because 2a, , = %, + &,, and the reciprocal 
roots a, have absolute value Jp (see (2.7) for E = E,,, and the discussion of 
the Weil conjectures in §1 for the general case). Thus, roughly speaking, 
Ni, p+ Vp. If N, spent an equal time on both sides of p as p varies, one 
could expect the infinite product of the p/N, to converge to a nonzero limit. 
(See Problem | below.) If, on the other hand, the N, had a tendency to be 
on the large side: N, > p+ le then we would Gbtain L(E, 1) TI, p/ 
(p + Jp) =, 1/0 +p) = 

To conclude this heuristic aeate we point out that, if there are 
infinitely many rational points, one would expect that by reducing them 
modulo p (as in the proof of Proposition 17 in §1.9) we would obtain a large 
guaranteed contribution to N, for all p, thereby ensuring this lopsided 
behavior N, = p + SB On the. other hand, if there are only finitely many 
rational points, then their contribution to N, would be negligible for large p, 
so that N, would have the “random” behaves Noe pt a D: Needless to say, 
this heuristic argument is not of much help in trying to prove the weak 
Birch—Swinnerton-Dyer conjecture. 

However, there is a remarkable result due to D. Goldfeld which states the 
following [Goldfeld 1982]: if the infinite product [[, (N,/p) does converge to 
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a nonzero limit, then (a) that limit is J2/L(E, 1), i.e., it is equal not to the 
reciprocal of the critical value (as in the above heuristic argument) but rather 
to the reciprocal of the critical value multiplied by nee and (b) the analogue 
of the Generalized Riemann Hypothesis holds for the Hasse— Weil L-function 
L(E, S). Because of part (b), no one expects to be able to prove convergence 
of [[,,(N,/p) any time soon (even for a single specific case). 

The really convincing evidence to support the conjecture of Birch and 
Swinnerton-Dyer comes not from the above heuristics or computational 
examples, but rather from a series of dramatic partial results in the direction 
of the conjecture. The first breakthrough came in 1977, when Coates and Wiles 
proved that for a large class of elliptic curves, an infinite number of rational 
points implies that L(E, 1) = 0. (Other partial results will be discussed briefly 
at the end of the book.) 

Recall from Problem 8 of §1.8 that an elliptic curve is said to have complex 
multiplication if its lattice is taken to itself under multiplication by some 
complex numbers other than integers. 


Theorem (J. Coates and A. Wiles). Let E be an elliptic curve defined over Q 
and having complex multiplication. If E has infinitely many Q-points, then 
L(E, 1) = 0. 


The proof of this theorem is rather difficult (see [Coates and Wiles 1977]), 
and it will not be given here. (The original proof further assumed that the 
quadratic imaginary field of complex multiplication has class number 1, but 
this turned out not to be necessary.) 

Since our curves E, have complex multiplication, the Coates—Wiles 
theorem applies, and, in view of Proposition 18 of Chapter I, tells us that 
if L(E,, 1) #0, then n is not a congruent number. Conversely, if we allow 
ourselves the weak Birch—Swinnerton-Dyer conjecture, then it follows that 
L(E,,, 1) = 0 implies that n is a congruent number. 

There is one situation in which it is easy to know that L(E,, 1) = 0. 
Recall that the “root number”—the plus or minus sign in the functional 
equation for L(E,, s)—is equal to (=?) for n odd, and (j.) for n = 2ng even 
(see the theorem in §5). 


Proposition 12. /f n = 5, 6 or 7 (mod 8), and if the weak Birch—Swinnerton- 
Dyer conjecture holds for E,,, then n is a congruent number. 


Proor. According to the theorem in §5, if m = 5, 6, 7 (mod 8), then A(s) = 
—A(2 — s), where A(s) is given by (5.11). Substituting s = 1, we conclude 
that A(1) = —A(1), i-e., AC) = 0. But by (5.11), AC) differs from L(E,, 1) 
by a nonzero factor (namely, JN/2n). Thus, L(E,, 1) = 0, and the weak 
Birch—Swinnerton-Dyer conjecture then tells us that £,, has infinitely many 
Q-points, i.e., by Proposition 18 of Chapter I, 7 isa congruent number. O 


In certain cases, the claim that all » = 5, 6, 7 (mod 8) are congruent num- 
bers has been verified without assuming the weak Birch-Swinnerton-Dyer 
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conjecture. A method due to Heegner (see [Birch 1975]) for constructing 
points on £, enables one to prove this claim for n equal to a prime or twice 
‘a prime. That is, if 7 is a prime congruent to 5 or 7 modulo 8, or twice a 
prime congruent to 3 mod 4, then 7 is known to be a congruent number. 

It is interesting to note that even in the cases when Heegner’s method 
ensures us that 1 is a congruent number, the method does not give us an 
effective algorithm for constructing a nontrivial rational point on E,, or 
equivalently, finding a right triangle with rational sides and area n. 

Somewhat later, Gross and Zagier were able to improve greatly upon 
Heegner’s method. As a special case of their results, they showed that for 
n= 5, 6, 7 (mod 8) the elliptic curve E, has infinitely many rational points 
provided that L(E,, s) has only a simple zero at s = 1, ie., L’(E,, 1) #0. 
This result represents substantial progress in making Proposition 12 un- 
conditional. Moreover, their method is constructive, i.e., it gives you a 
rational point on the curve (equivalently, a right triangle with area n) when 
L(E,, 1) 4 0. See [Gross and Zagier 1983 and 1986] and [Coates 1986]. 

In the cases when the root number is +1, we cannot be sure in advance 
whether L(E,, 1) is zero or nonzero. So in those cases it is useful to have an 
efficient algorithm for computing L(E,, 1), at least to enough accuracy to 
know for certain that it is nonzero. (It is harder to be sure of ourselves in 
cases when the critical value seems to be zero.) We cannot use the series (5.3) 
or (5.8) to evaluate L(E,, 1), since they only converge when Re s > 3. 

So we now turn our attention to finding a rapidly convergent expression 
for L(E,, 1). 

Let us return to the functional equation for L(E,, s), and give a slightly 
different, more efficient proof. Recall that 


l = -s 
L(Ey. 5) = 4 Dy da(x)(Nx) 
xeZ[i] 
with 7,(x) = xy,(x), where y, was defined in (3.3)—(3.4). Suppose we ask 
the question, ““What function F(E,, 1) has 2 ‘T(s)L(E,, s) as its Mellin 
transform?’’ By our usual method using (4.6), we see that the answer is 


FEp =D ale”, (6.1) 
a 4 Hi) 
We now proceed to find a functional equation for F(E,, ¢), which will then 
immediately give us once again our functional equation for the Mellin 
transform L(E,, s). The only difference with our earlier derivation is whether 
we take the character sum before or after applying the functional equation 
(compare with the two derivations in Problems 3 and 5 of §IJ.4 and in 
Problems 4 and 6 for the functional equation for a Dirichlet L-series). 
Recall that 7;,(x) is a primitive character of (Z[i ]/n’)*, where n’ = (2 + 2i)n 
for n odd, n’ = 2n for n even. Let a+ bi run through some set of coset 
representatives of Z[i] modulo n’, and for eacha + bidefine a corresponding 
pair (w,, u) of rational numbers by: u, + u,i = (a + bi)/n’. Replacing x by 
a+ bi+n(m-(1, i)) for meZ?, and setting 
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os ae 
le Nery (6.2) 
(see (5.10)), we can rewrite F(E,, t) as follows: 
FE, 0=— YD xlatoi) Y m+w-(1, deminer, 


4 at+bieZ[i]/n’ meZ2 


If we replace ¢ by yz, the summand in the inner sum becomes 6,(4) in the 
notation of (5.13). We then use the functional equation (5.16) for 6,(+). 
As a result we obtain: 


1\ w . 
F(E,—)=" ¥ x(a t bi)(—it?) ¥ m-(1, e2tmte ml 
N't 4 atti ™ 


= —3tn Ym (1, ie"? LY xia + bile, 


atbi 


Now m-u= Re((m, — m,i)(u, + uyi)) = Re((m, — m,i)(a + bi)/n’). We 
now use Problem 2 of §II.2 to rewrite the last inner sum as 


Xm, ine mi) Y ya Be bi)e2 Relat bin’) 
atbi 


But 7,(m, — mzi) = x,(m, + m,i), and the sum here is the Gauss sum g(y7/) 
evaluated in Proposition 4 (see (3.9)). We finally obtain: 


1 i , = on ety 
F(E, aa) = —grnen ) ee %,(m, + mie tm, 


where the ¢ is (7) for n odd, i(;3) for n = 2no even. Replacing m, + myi by 
xeZ[i], we see that the sum is precisely 4F(E,, t). Thus, we have 


= N’t?F(E,, t), nodd; 


1 

F( E,,-—]= 6.3 

Loe . 

(= N't?F(E,, t), n= 2ngeven. 
No 


We can easily derive the functional equation for L(E,, s) from (6.3). We 
shall write + to denote (+7) for n odd, (2) for n = 2ng even. We have 


© dt 1 [{* 1 \ dt 
“s a SF(E,, th— = +— ts-2F | E, — | — 
n*T(s)L(E,, 8) I, “F(E,, )— = +5, ( ( " wa) 4 


by (6.3). Making the change of variables u = 4, we obtain: 


d 
n°T(s)L(E, 8) = N75 | uw-F(E,, uw) 
10) 


= +N ~'75-?T(2 — s)L(E,, 2 — 5). 


Finally, replacing N’'~* by (./'N/2)2-25 and multiplying both sides by 
(/'N /2)°, we obtain the functional equation of (5.11)—(5.12). 
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We now use our function F(E,, t) in the case when the root number is +1, 
ie. n = 1, 2, 3 (mod 8), in order to find a convenient expression for L(E,, 1). 
Thus,- suppose that 


1 
—_ |= N’t?F(E,, t). 6.4 
F (Ex a) t ( n? t) ( ) 


We use this functional equation to break up the Mellin transform of F(E,, t) 
into two integrals from =+= to oo. The point =f; is the “center” of the func- 
tional equation, i.e., the fixed point of the correspondence f © 3. 


We have: 


oo ra) 1//N” 
nT (s)L(E,, 5) sl vF(E,, as =| + | UF(E,, oe. 
(0) 1//N" (0) : 
In the second integral we replace ¢ by 4;, and then use (6.4) to write F(E,, #5) 
in terms of F(E,, t). The result is: 


n T(s)L(E,, 5) = (OF(E,, t) + NS? -SF(E,, py 
1/VN™ 
Now set s = 1. Multiplying both sides by 2, we immediately obtain: 
L(E,, 1) =2n | F(E,, pdt. (6.5) 
1/VNv 


Recall that the Hasse—Weil L-function can be written as a Dirichlet series 


L(E,, 8) = ¥ On nm’, where by» =% Y Xn(X)- (6.6) 
m=1 xeZ[i] 


Nx=m 


Comparing with the definition (6.1) of F(E,, t), we see that 
HE eh Sb. ge, (6.7) 
m=1 


We can now substitute the series (6.7) into (6.5) and integrate term by term. 
(Notice that the procedure below will work only because we have a positive 
lower limit of integration in (6.5); if we tried directly to use the Mellin 
transform, in which the lower limit of integration is 0, we would not have 
convergence.) Using the formula |? e~“dt = te“ with a= N’'?, c= 2m, 
we immediately obtain the following rapidly convergent infinite series for 
L(E,, 1). 


Proposition 13. The critical value of the Hasse—Weil L-function of the elliptic 
curve E,: y? = x° — n’x for squarefree n = 1, 2, 3 (mod 8) is given by: 


= We 2n./2, nodd; 
L(E,, 1) = 2 y Saher, where v=} n/2 
m=1 


6.8 
2n, n even, Oe) 


Here the coefficients b,., are the Dirichlet series coefficients obtained by 
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expanding 
L(E,, 8) = [] (i — 2ag, pp t+ py = Yo By nt 
pt2n m=1 


In addition, the absolute value of the coefficient b,,, is bounded by do(m),/m, 
where G9(m) denotes the number of divisors of m. 


Proor. We have already proved all except for the bound on 4,, ,. If we write 
the Euler factor in the form (1 — a,p *)"*(1 — &,p°)”', expand each factor 
in a geometric series, and collect coefficients of p~* for each positive integer 
e, we find that the coefficient of p™® is ag + af -'G, + af 70) +--+ + HE. 
This means that, if 7 has prime factorization m = pf: - - - pfér, then 


— ; ej eile sae asi 
Dre ul (gj + of ay, +--+ + Gp). 
Since |«,| = |%,| = \/p for all p, we immediately obtain the bound 
[Bra.n| Ss I] (e + 1) p#i? = do(m),/m, 
j=l 


where we have used the easy fact from elementary number theory that ¢9(™m) 
is the product of the (e; + 1). This completes the proof. Oo 


The bound for 4,,., is useful in estimating the remainder after we compute 
the series in (6.8) out to the M-th place. In particular, if we find that the 
remainder is less than the value of that partial sum, we may conclude that 
L(E,, 1) #90. 

As an example, we treat the case n = 1. The first few Dirichlet series 
coefficients 5,, for L(E,, s) are given in (5.4). By (6.8), we have 


L(E,, 1) 
= 2(e m2 _ 2e~ Sn/av2 2. dps nav? 4 eT 1 3t/2v2 A Bre} Inl2v2 eee) 
= 0.6555143... + Ros, 


where we have denoted Ry = 22,54 "me 
A very crude estimate for a9(m) is 2,/m (see the problems). Thus, 


—nxm/2/2 


= 4 = 
—nm/2/2 _ —aM/2/2 
[Rul s 72 i re : 


So R,; is bounded by 5.2 x 1071”. Actually, the convergence is so fast that 
we only needed to evaluate the first term to show that L(E,, 1) 40: 
L(E,, 1) = 0.6586... + Rs, with |Rs| < 0.023. 


This computation, together with the Coates—Wiles theorem, then tells us 
that | is not acongruent number. In fact, this argument undoubtedly qualifies 
as the world’s most roundabout proof of that fact, which was proved by 
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Fermat more than three hundred years ago. (See [Weil 1973, p. 270 of Vol. 
III of Collected Papers]; see also Problem 3 of §I.1.) 

The next topic we take up is the systematic study of functions such as 
theta-series which have certain types of functional equations under tr++ 
and similar changes of variable. Such functions are called “modular forms”. 
Actually, modular forms are functions of the form 26,,e?"" rather than 
x5,,e ™, but the simple substitution t = —2iz will transform our theta-series 
from this chapter into what turns out to be modular forms. 

In studying modular forms, we will at the same time be approaching elliptic 
curves from another perspective. But these two aspects of elliptic curves—the 
congruence zeta-function and Hasse—Weil L-series, and the theory of 
modular forms—have combined in recent years to form a richly interlocking 
picture. 


PROBLEMS 


1. In the heuristic argument for the weak Birch-Swinnerton-Dyer conjecture, make the 
following ridiculous assumptions: (i) |2a,, — 1| = /p; and (ii) (2ag,, — 1)/,/p = +1 
is “evenly”’ distributed, and happens to coincide with the value at p of a fixed quadra- 
tic Dirichlet character y(p) = (8) for some fixed N. (One of the reasons why these 
assumptions are ridiculous is that 2a, , is an integer.) Show that then L(E£, 1) is 
equal to the value L(y, 4) of the Dirichlet L-function at the center of symmetry of its 
functional equation. 


2. Prove that if the root number in the functional equation for L(E,, 5) is 1, then 
L(E,, 5) has either a nonzero value or else an even-order zero at s = 1; and if the 
root number is —1, then L(£,, s) has an odd-order zero at s = 1. 


3. In the notation of Proposition 13 (here we abbreviate b,, = by. 1, 4p = 4g, p)» Prove 
that: 
(a) 6, = 2a, if pf{2n; b, = 0 if p|2n; 
(6) Bm, my = Om, 2m, ifm, and m, are relatively prime; 
(c) bye+1 = 2a,b,e — pb,e-1 for e > 0 (here take b,,, = 0 when e = 0). 


4. Prove that o9(m) < 2,/m for all m, and that m~*a(m) — 0 as m - oo for any positive 
é. 


5. Compute L(E3, 1) and L(E3, 1) to about three decimal places of accuracy, verifying 
that they are nonzero. 


6. Prove that L(E,9, 1) #0. 


7. Suppose you knew a lower bound c for the absolute value of all nonzero L(E,, 1), 
n= 1, 2,3, ... squarefree. (No such c is known.) For n very large, what is the order 
of magnitude of M such that you could determine from the first M terms in (6.8) 
whether or not L(E,, 1) = 0? 


8. (a) Write a flow chart for a computer program that evaluates L(E,, 1) through the 
M-th term of (6.8) and estimates the remainder. 
(b) If you have a computer handy, use part (a) to find L(E,,, 1) to three decimal 
places. 


CHAPTER III 
Modular Forms 


Our treatment of the introductory material will be similar to that in Serre’s 
A Course in Arithmetic (Chapter VI), except that we shall bring in the 
“level” from the very beginning. 
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For any commutative ring R, the “general linear group” GL,(R) is defined 
to be the set of matrices g = (¢ 5) such that det g = ad — bc is in R* (the 
multiplicative group of invertible elements of R). It is easy to see that GL,(R) 
is a group. The “‘special linear group” SL,(R) is defined to be the subgroup 
of GL,(R) consisting of matrices of determinant 1. In this section we shall 
be concerned with the cases R = R (the real numbers), R= Z, R= Z/NZ 
(for a positive integer NV). 

Let € denote C u {co} (i.e., the complex plane with a point at infinity, or 
equivalently the complex projective line Pi, also known as the “Riemann 
sphere’’). Given an element g = (4 °)e SL,(R) and a point zEC, we define 


pe tb. 
def cz +d” 


gx =a/c = lim gz. (1.1) 


(Thus, g(—d/c) = 00, and if c= 0, then goo = 00.) These maps z+» gz are 
called “fractional linear transformations” of the Riemann sphere C. It is 
easy to check that (1.1) defines a group action on the set €, in other words: 
91(922) = (9192)z for all g;, ggeSL2(R), zEC. 

Notice that for g = —I=(~} _°)e SL,(R), (1.1) gives the identity map. 
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But +/are the only matrices which act trivially on C, as we see by supposing 
that z = (az + b)/(cz + d) for all z, ie., cz? + (d—a)z —b=0 for all z, 
which implies that b = c = 0 and a = d; but the only scalar matrices (§ ?) of 
determinant 1 are +/. Thus, the quotient group SL,(R)/+J, which is 
sometimes denoted PSL,(R), acts “faithfully” on C, in other words, each 
element other than the identity acts nontrivially. 

Let H < C denote the upper half-plane H = {zeC|Im z > 0}. It is impor- 
tant to note that any g € SL,(R) preserves H, i.e., Im z > Oimplies Im gz > 0. 
This is because 


az+b es (az + b)(cZ +d) _ 


wea lee ad? = |cz + d|~* Im(adz + bcz). 


Im gz = Im 
But Im(adz + bcz) = (ad — bc)Im z = Im z, since det g = 1. Thus, 
b 
Im gz=|cz+a|-?Imz for g= (: je SEAR (1.2) 
c 


Thus, the group SL,(R) acts on the set H by the transformations (1.1). 

The subgroup of SL,(R) consisting of matrices with integer entries is, 
by definition, SL,(Z). It is called the “full modular group’’, and is sometimes 
denoted I’. We shall denote T = = I/41. (Whenever we have a subgroup G 
of SL,(R), we shall let G denote G/+/ if G contains —/; if —I¢G, we set 
G =G.) This group I = SL,(Z)/+/ acts faithfully on H. It is one of the 
basic groups arising in number theory and other branches of mathematics. 

Besides T = SL,(Z), certain of its subgroups have special significance. 
Let N be a positive integer. We define 


T(N) = i: je St) = d= 1 (mod N), b = ¢ = 0 (mod mh. 
tile d 
(1.3) 


This is a subgroup of T = SL,(Z), actually a normal subgroup, because it 
is the kernel of the group homomorphism from I to SL,(Z/NZ) obtained 
by reducing entries modulo N. In other words, ['(N) consists of 2 x 2 
integer matrices of determinant 1 which are congruent modulo N to the 
identity matrix. Note that (1) = IT. We shall later see that [(4) is analogous 
to the subsemigroup 1+ NZ cZ consisting of integers congruent to 1 
modulo N. I'(N) is called the ‘‘principal congruence subgroup of level N”’. 

Notice that (2) = T'(2)/+/, whereas for N > 2 we have T(N) = I(N), 
because —1 # 1 (mod N), and so —/is not in T'(N). 

A subgroup of I (or of I’) is called a “congruence subgroup of level N” 
if it contains F(N) (or P(N), if we are considering matrices modulo +/). 
Notice that a congruence subgroup of level N also has level N’ for any 
multiple N’ of N. This is because [(N) > T'(N’). We say that a subgroup of 
I (or of I) is a “congruence subgroup” if there exists N such that it is a 
congruence subgroup of level N. Not all subgroups of I are congruence 
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subgroups, but we shall never have occasion to deal with non-congruence 
subgroups. 
For our purposes the most important congruence subgroups of I are: 


ria {(¢ jet le = 0 (od wy (1.4) 


b 
TN) = i(: ie To(N)|a = 1 (mod wy}. (1.5) 


It is easy to check that these are, in fact, subgroups. Note that if (4 ‘)e 
IT, (N), then since ad — bc = 1 and Nc, it follows that ad = 1 (mod N), and 
hence d= 1 (mod N). We sometimes abbreviate the definitions (1.3)—(1.5) 
as follows: 


rw)={(j 1) moan. r= (5 ;) moa wt: 
Ta(N) = 1 ‘) es vt. 


where * indicates the absence of any congruence condition modulo N. 

Whenever a group acts on a set, it divides the set into equivalence classes, 
where two points are said to be in the same equivalence class if there is an 
element of the group which takes one to the other. In particular, if G is a 
subgroup of I’, we say that two points z,, z,€ H are ““G-equivalent”’ if there 
exists géG such that z, = gz,. 

Let F be a closed region in H. (Usually, F will also be simply connected.) 
We say that F is a “fundamental domain” for the subgroup G of I’ if every 
z€H1 is G-equivalent to a point in F, but no two distinct points z,, z, in the 
interior of F are G-equivalent (two boundary points are permitted to be 
G-equivalent). 

The most famous example of a fundamental domain is shown in Fig. 
IIl.1: 


F = {zeH|—} < Rez <4 and |z| > 1}. (1.6) 


Proposition 1. The region F defined in (1.6) is a fundamental domain for T. 


Proor. The group = SL,(Z) contains two fractional linear transformations 
which act as building blocks for the entire group: 


1 1 
wae Jere 1; 


(1.7) 
sal) geet 
def ] 0 
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CS) oll ol 


Figure III.1 


To prove that every ze H is [-equivalent to a point in F, the idea is to 
use translations T/ to move a point z inside the strip —} < Rez <}. If it 
lands outside the unit circle, it is in F. Otherwise use S to throw the point 
outside the unit circle, then use a translation 7* to bring it inside the strip, 
and continue in this way until you get a point inside the strip and outside the 
unit circle. We now give a precise proof. 

Let ze be fixed. Let I’ be the subgroup of I generated by S and T 
(we shall soon see that actually lr = +I’). If y=(¢ 2)eT’, then Im yz = 
Im z/|cz + d|? by (1.2). Since c and d are integers, the numbers |cz + d| 
are bounded away from zero. (Geometrically, as c and d vary through all 
integers, the complex numbers cz + d run through the lattice generated by 
1 and z; and there is a disc around 0 which contains no nonzero lattice 
point.) Thus, there is some y = (% 5)€T’ such that Im yz is maximal. Replac- 
ing y by T“y for some suitable j, without loss of generality we may suppose 
that yz is in the strip —4 < Re yz < }. But then, if yz were not in F, ie., if 
we had |z| < 1, then, by (1.2), we would have: 


Im Syz = Im yz/|yz|? > Im yz, 


which contradicts our choice of yeI’ so that Im yz is maximal. Thus, there 
exists ye I’ such that yze F. 

We now prove that no two points in the interior of F are T'-equivalent. 
We shall actually prove a more precise result. Suppose that z,, z,¢F are 
T-equivalent. Here we are not supposing that z, and z, are necessarily 
distinct or that they are necessarily in the interior of F. Without loss of 
generality, suppose that Im z, > Imz,. Let y = (¢ °)eT be such that z, = 
yz,. Since Im z, > Im 2, by (1.2) we must have |cz, + d; < 1. Since z, is in 
F and d is real (in fact, an integer), it is easy to see from Fig. IJI.1 that this 
inequality cannot hold if |c| > 2. This leaves the cases: (i) c=0, d= +1; 
(ii) c= +1, d=0, and z, is on the unit circle; (iii) c= d= +1 and z, = 
—14 53; (v)c¢ = —d= +landz, =4+4~5*. In case (i), either y or —y 
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is a translation 7’; but such a y can take a point in F to another point in F 
only if it is the identity or if j= +1 and the points are on the two vertical 
boundary lines Re z = +4. In case (ii), it is easy to see that y = +7°S with 
a = Oand z, and z, on the unit circle (and symmetrically located with respect 
to the imaginary axis) or with a= +1 and z, =z, = +44+~55. In case 


(iii), y can be written as +7? ~{), and if such a map takes z,¢€F to z,€F 


we must have a = 0 and z, =z, = —4+ 4 orelsea=1 andz,=2z,+1 
= 44 ~53. Case (iv) is handled in the same way as case (ili). We conclude 
that in no case can z, or z, belong to the interior of F, unless +y is the 


identity and z, = z,. This proves the proposition. o 


In the course of the proof of Proposition 1, we have established two other 
facts. 


Proposition 2. Two distinct points z,, Zon the boundary of F are Y-equivalent 
only if Rez, = +4 and z, =z, +1, or if z, is on the unit circle and z, = 3. 

In the next proposition, we use the notation G, for the “isotropy subgroup”’ 
of an element z in a set on which the group G acts: by definition, G, = 
{geGl|gz = z}. 


Proposition 3. If ze F, then T, = +1 except in the following three cases: 


@) P= +{1, S} ifz =i; 
di) F = +{J, ST, (ST)*} ifz=a= 
(iii) P= +4, TS, (TS)} fz = —@=$4 BB. 


Both Propositions 2 and 3 follow from the second part of the proof of 
Proposition 1. 

Notice that S* = —/, (ST)? = —J, and (TS)? = —/. Thus, in T = 
SL,(Z)/ +] the elements S, ST, TS generate cyclic subgroups of order 2, 3, 3, 
respectively ; and these subgroups are the isotropy subgroups of the elements 
i, w, —@, respectively. These points in H with nontrivial isotropy subgroups 
are called “elliptic points”’. 

Another by-product of the proof of Proposition | is the following useful 
fact. 


Proposition 4. The group T = SL,(Z)/+1 is generated by the two elements S 
and T (see (1.7)). In other words, any fractional linear transformation can be 
written as a “‘word” in S (the negative-reciprocal map) and T (translation by 1) 
and their inverses. 


Proor. As in the proof of Proposition 1, let I’ denote the subgroup of I 
generated by S and 7. Let z be any point in the interior of F (e.g. z = 21). 
Let g be an element of I’. Consider the point gzéH. In the first part of the 
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proof of Proposition 1, we showed that there exists y€I’ such that y(gz) € F. 
But since z is in the interior of F, it follows by Propositions 1 and 3 that 
yg=+ie.,g = +y 'eI”’. This shows that any geI is actually (up to a 
sign) in I’. The proposition is proved. Oo 


Thus, any element in I can be written in the form S"7"S°7T" .-- 
S%T*!, where all of the a;, b; are nonzero integers, except that we allow a, 
and/or b, to be zero; and, since S? = —J, we may suppose that all of the a; 
equal 1, except that a, = 0 or 1. We may also use the identity (ST)* = —1 
to achieve a further simplification in some cases. 

We now return to the fundamental domain F for T = SL,(Z). Recall that 
in Chapter I §4 we also had a fundamental domain, in that case a parallel- 
ogram II c C for the lattice L. In that case the group was L, the action of 
g¢Lona point zéC was simply g(z) = g + z. Every zeC is L-equivalent to 
a point in IT, and no two points in the interior of I are L-equivalent. In that 
situation we found it useful to glue together the boundary of I by identifying 
L-equivalent points. We obtained a torus, and then we found that the map 
z++(@(z), '(z)) gives an analytic isomorphism from the torus C/L to the 
elliptic curve y? = 4x? — g,(L)x — g3(L) (see §1.6). 

In our present situation, with the group I acting on the set H with funda- 
mental domain F, it is also useful to identify T-equivalent points on the 
boundary of F. Visually, we fold F around the imaginary axis, gluing the 
point 4+ iy to —4+ iy and the point e?*” to e"?2- for y > 3 and 
4 <0 <4. The resulting set F with its sides glued is in one-to-one corre- 
spondence with the set of T-equivalence classes in H, which we denote 
I\H. (The notation I'\H rather than H/T is customarily used because the 
group I acts on the set A on the left.) 

In Chapter I we saw how useful it is to ‘““complete”’ the picture by including 
a point or points at infinity. The same is true when we work with I\H. 

Let H denote the set HU {00} U Q. That is, we add to Ha point at infinity 
(which should be visualized far up the positive imaginary axis; for this 
reason we sometimes denote it ioo) and also all of the rational numbers on 
the real axis. These points {00} U Q are called “‘cusps’’. It is easy to see that 
I permutes the cusps transitively. Namely, any fraction a/c in lowest terms 
can be completed to a matrix (¢ 5)eT by solving ad — bc = 1 for d and b; 
this matrix takes oo to a/c. Hence all rational numbers are in the same 
T’-equivalence class as oo. 

If T’ is a subgroup of I, then I’ permutes the cusps, but in general not 
transitively. That is, there is usually more than one I’’-equivalence class 
among the cusps {co} U Q. We shall see examples later. By a “cusp of I” 
we mean a I’’-equivalence class of cusps. We may choose any convenient 
representative of the equivalence class to denote the cusp. Thus, we say that 
“T has a single cusp at 00”’, where oo can be replaced by any rational number 
a/c in this statement. 

We extend the usual topology on H to the set H as follows. First, a funda- 
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Figure III.2 


mental system of open neighborhoods of 00 is Nc = {ze H|Im z > C} u {00} 
for any C > 0. Note that if we map H to the punctured open unit disc by 
sending 


zeeqgne™, (1.8) 


and if we agree to take the point oo € H to the origin under this map, then 
Nc is the inverse image of the open disc of radius e~**“ centered at the origin, 
and we have defined our topology on H u {00} so as to make this map (1.8) 
continuous. 

The change of variables (1.8) from z to g plays a basic role in the theory of 
modular functions. We use (1.8) to define an analytic structure on H u {00}. 
In other words, given a function on H of period 1, we say that it is mero- 
morphic at oo if it can be expressed as a power series in the variable g having 
at most finitely many negative terms, i.e., it has a Fourier expansion of the 
form 

f(z) a yi anerrin® = 2, ang’, (1.9) 
in which a, = 0 for n «<0. We say that f(z) is holomorphic at oo if a, = 0 
for all negative n; and we say that f(z) vanishes at 00 if f(z) is holomorphic 
at oo and a, = 0. More generally, if f(z) has period N, then we use the map 
Zh qn e*™/N to map Hw {co} to the open unit disc. We then express 
f(z) asa series in gy, and say that it is meromorphic (is holomorphic, vanishes) 
at oo if a, = 0 for n << 0 (respectively, for n < 0, forn < 0). 

Next, near a cusp a/ce@ c H we define a fundamental system of open 
neighborhoods by completing a, c to a matrix a = (¢ 5)eT and using « to 
transport the Nc; to discs which are tangent to the real axis at a/c (see Fig. 
III.2). In other words, with this topology, to say that a sequence z; approaches 
a/c means that «~'z, approaches ico, i.e., that Im «~*z, approaches infinity 
in the usual sense. Notice, by the way, that the topology near the rational 
numbers a/c does not agree with the usual topology on the real line, i.e., a 
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sequence of rational numbers which approaches a/c as real numbers will 
not approach a/c in our topology. 

Let F denote the fundamental domain F with T-equivalent boundary 
points identified and with the cusp oo thrown in. Thus, the points of F are 
in one-to-one correspondence with I'-equivalence classes in H. We take the 
topology on F which comes from the topology on H. That is, by a small disc 
around an interior point of F we mean a disc in the usual sense; by a small 
disc around oo we mean all points lying about the line Im z = C, where C is 
large; by a small disc around a boundary point —} + iy we mean the half- 
disc contained in F together with the half-disc of the same radius around 
1 + iy which is contained in F; and so on. Thus, F has an analytic structure 
coming from the usual structure on H, except at oo, where it comes from 
the usual structure at 0 after the change of variable (1.8). 

In Chapter I we found that the Weierstrass -function gave us an analytic 
isomorphism of C/L with an elliptic curve in P2. In our present situation, 
we shall later see that a certain function (called the “j-invariant’’) gives an 
analytic isomorphism between F = I'\H and the projective line (Riemann 
sphere) Pé. 

We now look at fundamental domains for subgroups I’ < I’. Suppose 
that [[: ’] =n < 00, so that I’ can be written as a disjoint union of n 
cosets = |)", wT’. [claim that F’ = |_)?_, a; ’F will serve asa fundamental 
domain for I’. Let us verify that every ze H is I’’-equivalent to a point in 
F’; the verification that no two interior points of F’ are T’’-equivalent will 
be left as an exercise (see below). Let ze H. Since F is a fundamental domain 
for I’, we can find yeT such that yze F. Then for some i we have y = a,’ 
with y’ eI’, and hence y’zea;!F c F’, as desired. Roughly speaking, F’ is 
n times as big as F because there are one n-th as many elements y’ eT’ which 
one can use to move z around. 

There are many possible choices of the «; in the coset decomposition in the 
previous paragraph. In practice, we shall always try to choose the «; so that 
the resulting F’ is simply connected. 

As an example, let us find a fundamental domain F(2) for T’ = T(2). 
(We shall use the notation F(N) to mean any fundamental domain for '(N), 
F)(N) to mean any fundamental domain for I,(4), etc.) Since I'(2) is the 
kernel of the surjective map SL,(Z) > SL,(Z/2Z), and since SL,(Z/2Z) = 
GL,(Z/2Z) is isomorphic to S$; (see Problem 6 of §1.8), it follows that 
[I': 1(2)] = 6. As coset representatives of T modulo I'(2) let us choose: 


106 Il]. Modular Forms 


abe | | | 
3 _4 | rh 0 1 L 
2 3 a 3 2 
Figure III.3. A fundamental domain F(2) for I'(2). 
The resulting fundamental domain F(2) = | J&, 0; 'F is depicted in Fig. 


III.3. Because any fractional linear transformation takes a circle or line to 
a circle or line and preserves symmetry about the real axis, it follows that the 
boundary of any fundamental domain constructed in this way consists of 
vertical lines and arcs of circles centered at rational numbers on the real 
axis. The boundary of the fundamental domain in Fig. IHI.3 is made up from 
the vertical lines Re z = —3 and Rez = 3, the circles of radius | centered 
at 0 and —1, and the circles of radius 4 centered at } and —4. 

We see that I'(2) has three cusps: 00, 0, —1. That is, there are three 
I'(2)-equivalence classes of cusps with those three points as representatives. 
Now the etymology of the word “cusp” is clear: geometrically, F(2) has the 
appearance we usually associate with the word ‘“‘cusp”’ at the points 0 and 
—1. 


PROBLEMS 


1. Prove that (4) is a normal subgroup of Ip(4V) but not of I. Is [o(V) a normal 
subgroup of T-? 
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15. 
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. (a) Prove that for any N, the map SL,(Z) > SL,(Z/NZ) obtained by reducing the 


matrix entries modulo N is a surjective group homomorphism. 

(b) Prove that for any positive integers M and N, the maps (“‘reduction modulo 
N’) from SL,(Z/MNZ) to SL,(Z/NZ) and from GL,(Z/MNZ) to GL,(Z/NZ) 
are surjective group homomorphisms. 


. What is the order of the group (a) GL,(F,)? (b) SL2(F,)? 
. (a) What is the kernel of the homomorphism GL,(Z/p*Z) > GL,(Z/pZ)? 


(b) What is the order of the group GL,(Z/p°Z)? 
(c) What is the order of the group SL,(Z/p‘Z)? 


. Let N = p<: --- pér be the prime factorization of the positive integer NV. Show that 


the reductions modulo pfi, j= 1, ..., 7, give isomorphisms 


GL,(Z/NZ) 3T1,GL,(Z/psiZ) and SL,(Z/NZ) > 01,SL2(Z/peZ). 


. What is the order of the group SL,(Z/NZ)? 
. Find the indices [T: T(N)], (T,(4): F(N)], [Po(N): P(N) ]. [Po(4V): P(N) ], and 


[P: T(N)]. 


. Find a simple isomorphism from (NV) to a subgroup of I')(N) having index ¢(N) 


in T,(N%). In particular, (2) and I'p(4) are isomorphic. 


. Suppose that I’, and I’, are two subgroups of finite index in’, andl, = aI,«7' for 


some «€GL,(Q). If F, is a fundamental domain for T’,, prove that af, is a fun- 
damental domain for T,. 


. Using the previous problem, draw a fundamental domain for I,(4). 


. Suppose that I = (_)"_, a,I”, where I’ is a subgroup of index n in I. Let F’ = 


("-. a; |F. Show that no two distinct points in the interior of F’ are I’-equivalent. 


. Describe all congruence subgroups I’ of level 2, i.e., all groups contained between 


T and ['(2): (2) cl’ cI. Foreach such I’, find a simply connected fundamental 
domain by taking a suitable part of the fundamental domain F(2) for '(2) that was 
given in the text. 


. (a) Prove that +S and T?, with S and 7 as in (1.7), generate one of the subgroups 


of level 2 in the previous problem. That group was denoted (2) by Hecke. 
(b) Prove that TST and T? generate T'°(2) alG °) mod 2}; and that Tand ST’S 
generate I'(2). 
(c) Prove that the elements 7? and ST~?S generate I'(2). 
(d) Prove that T and ST~‘*S generate T'y(4). 


. (a) Prove that the following is a complete set of coset representatives {a,} for 


Io(p’) in T, ie., F = (JaMo(p*) is a disjoint union: 
1; TKS, k=0,1,...,p°—1; STS, k=1,2,...,po=1. 


(b) Use part (a) to draw a fundamental domain for T'9(4). 
(c) Using part (a), describe a fundamental domain for I'9(p). Draw the fundamen- 
tal domain for T,(3). 


Suppose I’, and I’, are subgroups of finite index in I having fundamental domains 
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F, and F;, respectively. Does F; < F, imply that T, > T’,? (Give a proof or 
counterexample.) 


16. (a) The assertion at the end of the section that (2) has 3 cusps (i.e., that there are 
three I'(2)-equivalence classes of cusps) does not immediately follow from the 
appearance of the fundamental domain, because boundary points of the fun- 
damental domain may be I'(2)-equivalent. But prove directly that oo, 0, —1 
are I'(2)-inequivalent to one another. 

(b) How many cusps does each of the congruence subgroups in Problem 12 have? 


17. Let {a;} be a complete set of coset representatives for I’ in T, where I’ is a sub- 
group of finite index in I. Show that the cusps of I’ are among the set {a;' 00}, but 
that oj’ 00 and a,‘ oo are I’’-equivalent if and only if there exists neZ such that 


a, T'a,eT’. 


18. Prove that I'y(p) has two cusps co and 0; and that I(p”) has p + 1 cusps: 00, 0, 
and —I/kpfork=1,...,p—1. - 


§2. Modular forms for SL,(Z) 


Definition. Let f(z) be a meromorphic function on the upper half-plane H, 
and let k be an integer. Suppose that f(z) satisfies the relation 


fz) =(cz + 'f@) forall y= (’ j)eSE@) (2.1) 

In particular, for the elements » = T = (9}) and y = S=(_¢ 9), (2.1) gives 
f+ I) =f; (2.2) 
f(-1/z) = (—2z)‘f@). (2.3) 


Further suppose that f(z) is ‘‘meromorphic at infinity”. Recall that this 
means that the Fourier series 
f(z) = > aq", where g=e?"", (2.4) 
neZ 
has at most finitely many nonzero a, with n <0. Then f(z) is called a 
“modular function of weight k for l = SL,(Z)’. 

If, in addition, f(z) is actually holomorphic on H and at infinity (i.e., a, = 0 
for all n <0), then f(z) is called a “modular form of weight & for T = 
SL,(Z)’. The set of such functions is denoted M,(T). 

If we further have a) = 0, i.e., the modular form vanishes at infinity, then 
JS(2) is called a ‘‘cusp-form of weight k for T’’’. The set of such functions is 
denoted S,(I). (The use of the letter S is traditional, and comes from the 
German “‘Spitzenform”’ for ‘‘cusp-form’’. Cusp-forms are sometimes also 
called “parabolic forms’’.) 
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Finally, the expansion (2.4) for a modular function f(z) is called its 
“‘g-expansion”’. 
We first make some easily verified remarks about these definitions. 


Remarks. 1. If k is odd, there are no nonzero modular functions of weight 
k for. We see this by substituting y = (“3 _9) in (2.1). So in this section we 
shall always suppose that & is even. 

2. Since 42 = £((az + b)/(cz + d)) = (cz + d)~?, we can rewrite (2.1) in 
the form: (42)"?f(yz) = f(z), ie., f(z) (dz)? is invariant when z is replaced 
by yz. From this we see that if (2.1) holds for y, and y,, then it holds for 
712. Since all of I is generated by S and T, this means that (2.2)-(2.3) 
imply (2.1). 

3. The conditions are preserved under addition and scalar multiplication, 
i.e., the sets of modular functions, forms, and cusp-forms of some fixed 
weight are complex vector spaces. In addition, the product of a modular 
function (or form) of weight k, and a modular function (respectively, form) 
of weight k, is a modular function (respectively, form) of weight k, + k.; 
and the quotient of a modular function of weight k, by a nonzero modular 
function of weight k, is a modular function of weight k, — k,. In particular, 
the set of modular functions of weight zero is a field. 

4. If k =0, then the condition (2.1) says that f(z) is invariant under 
IT = SL,(2), i.e., it may be considered as a function on I\H. If k = 2, then 
the differential form f(z)dz on H is invariant under I, i.e., f(yz)dyz = f(z)dz, 
since dyz/dz = (cz + d)”?. 


EXAMPLE (Eisenstein series). Let k be an even integer greater than 2. For 
zéH we define 


Go2y 


‘______ 2.5 
in (mz + nye? 22) 


where the summation is over pairs of integers m, n not both zero. If we let 
L, denote the lattice in C spanned by 1 and z, then this is a familiar definition 
from Chapter I (see (6.3) of Chapter I). That is, the function G,(z) in (2.5) 
is what we then denoted G,(L,). The new point of view in this chapter is to 
think of G,(z) = G,(L,) as functions of z, not merely as coefficients in the 
Laurent expansion of the Weierstrass g-function. (The letter z was used in 
a different way in §6 of Chapter I.) 

Because k is at least 4, the double sum (2.5) is absolutely convergent, and 
uniformly convergent in any compact subset of H. Hence G;,(z) is a holo- 
morphic function on H. It is also obvious that G,(z) = G,(z + 1), and that 
the Fourier expansion (2.4) has no negative terms, because G,(z) approaches 
a finite limit as z > ico: 


lim \’ (mz +n) * = ¥ n* = 2¢(k). 
ea m,n n#0 
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Finally, we easily check that 
z*G,(—1/z) = ¥’ (—m + nz) * = G,(2), 


i.e., G,(z) satisfies (2.3). We have proved 
Proposition 5. 
G,éM,(T). 


We now compute the g-expansion coefficients for G,. We shall find that 
these coefficients are essentially the arithmetic functions of n 


oi) = a. (2.6) 


d|n 


Proposition 6. Let k be an even integer greater than 2, and let ze H. Then the 
modular form G,(z) defined in (2.5) has q-expansion 


2k & " 
G,(z) = 2¢(k) (: Re y %-1")9 ) (2.7) 
kn=1 
where q = e?", and the Bernoulli numbers B, are defined by setting 
x Oi ae 
= YB. 2.8 
e—] » Kk ( ) 


(Note. Our notation is slightly different from Serre’s in A Course in 
Arithmetic. Basically, he uses 2k where we use k.) 


Proor. The logarithmic derivative of the product formula for sine is 


1 = 1 1 
ial H. . 
7m cot(ma) Pe ¥(. — + i -). ae (2.9) 


If we write the left side as zi(e™* + e-™)/(e™ — e7™4) = ni + 2ni/(e?™4 — 1), 

multiply both sides by a, replace 2zia by x, and expand both sides in powers 
of x (using (2.8)), we obtain the well-known formula for ¢(k): 

(2ni)* B, 

2 =k! 


Next, if we successively differentiate both sides of (2.9) with respect to a 
(see Problem 8(d)-—(e) of $1.4) and then replace a by mz, we obtain: 


C(k) = for k > Oeven. (2.10) 


. 1 = (2ni)* ~ k-1 ,2ninmz __ _ 2k k ~ d*} dm 
ome eH”? BO) 209 


(where we have used (2.10) and replaced n by d and e?” by q). Thus, 


AC oat oe ae ee 


m=1 n=— 0 (mz + n)* 


= 2¢(k) (: 2 y atgin)| 


km,d=1 
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Collecting coefficients of a fixed power q” in the last double sum, we obtain 
the sum in (2.6) as the coefficient of g". This completes the proof. a 


Because of Proposition 6, it is useful to define the ‘‘normalized Eisenstein 
series’, obtained by dividing G,(z) by the constant 2¢(k) in (2.7): 


9D; fea) 
E,(z) = oO) Sates re) on-s(7)q". (2.11) 


Thus, £,(z) is defined so as to have rational g-expansion coefficients. The 
first few E, are: 


E,(z) = 1+ 240 ¥° o3(n)q"; 
n=1 

E,(z) = 1 — 504 }' a5(n)q"; 
n=1 

E,(z) = 1+ 480 ¥° a,(n)q"; 
n=1 


E,.(z) = 1 — 264 » o4(n)q" ; 
n=1 


65520 & : 
E,,(z)=1+—— » 01 1(1)q"; 
691 


E,4(z) = 1 — 24 > o13(")q". 
n=1 


An alternate way of defining the normalized Eisenstein series is to sum only 
over relatively prime pairs m, n in (2.5): 


1 1 
E,(z) = 
(2) 2 en Oe EE (2.12) 
(m,n)=1 


where (m, n) denotes the greatest common divisor. The equivalence of (2.12) 
and (2.11) will be left as an exercise (see below). 


EXAMPLE (The discriminant modular form A(z)). Recall from our study of 
the Weierstrass g-function of a lattice L that one defines g,(L) = 60G,(L), 
g3(L) = 140G,(ZL) to be the coefficients occurring in the differential equation 
satisfied by the g-function (i.e., in the equation of the elliptic curve corre- 
sponding to L; see (6.8)—(6.9) in §1.6). Now let us define g,(z) = g,(L.), 
g3(z) = g3(L,) for ze H. That is, 


gx(z) = 60G,(z); xz) = 140G,(z). (2.13) 


Then g,(z) and g3(z) are modular forms for I’ of weight 4 and 6, respectively. 
Since €(4) = 2*/90 and ¢(6) = 2°/945, we can express g, and g, in terms of 
the normalized Eisenstein series E, and FE, as follows: 
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g2(z) = 3m*E4(z);— 93(z) = 240° E,(z). (2.14) 


The discriminant of the elliptic curve corresponding to L,—a function 
of the lattice which is nonvanishing for all nontrivial lattices, i.e., in all cases 
when the cubic polynomial 4x* — g,(L,)x — g3(L,) has distinct roots—is 
given by (see Problem 2 of §1.6) 


A(z) = ga(2)? ~ 27a? = P(E - F@). —@.13) 

By Remark 3 following the definition of a modular form, we see that A(z) 
is a modular form of weight 12 for T. Moreover, because both £,(z) and 
E,(z) have constant term a) = 1 in their g-expansions, we see that the 
constant term in (2.15) is zero, i.e., A(z) is a cusp-form. It is the first example 
of a cusp-form that we have seen. We shall later see that it is the cusp-form 
of lowest possible weight for I. 

The qg-expansion of (2m)~!*A(z) has rational coefficients, and the first 
coefficient is easily computed to be 1 (use: FE, =1+240¢+---, Eg = 
1 — 504q + ---). We shall later prove a remarkable product formula, due 
to Jacobi, for this discriminant function. 


The example of Eisenstein series gives us one modular form for every even 
weight starting with 4. It might seem unfortunate to have to pass up 2. Is 
there any Eisenstein series that can be salvaged in the case k = 2? 

It turns out that there is, but the normalized Eisenstein series FE, is not 
a modular form. We use the same definition as for the other F,, except that 
the double sum when & = 2 is not absolutely convergent, and we must take 
care with the order of summation. Note that in the proof of Proposition 5, 
when showing that z~*G,(—1/z) = G,(z), we needed to reverse the order of 
summation over m and n. Because this change of order of summation is no 
longer justified when we only have conditional convergence, it turns out 
that E, fails to transform “correctly” under z+> — 1/z. E, satisfies a slightly 
more complicated transformation rule, a rule that will be used in important 
ways later. 

Thus, we define (here ’ means that n 4 0 if m = 0) 


i. = 1 
Ee) LD woe es (mz +n)? 
=1+ = 3 : (2.16) 


TK” m#0 7 +n)’ 


6 ce) 00 
bt ae » +o? (mz + a 

The inner sums clearly converge for any z¢ H and any m; and, once we 
obtain a different expression for the inner sums, we shall see that the outer 
sum over m then converges absolutely. As in the proof of Proposition 6, 
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we find that for any fixed m = 1, 2, 3, ... 


@ 1 4,. 2 ; 
=— 2 dq@™, where gq =e?™. 
De Pye Be Rh roe 


This gives us 


E,=1-24¥ Y¥ dq™. 
m=1d=1 
Since |q| < 1, it is easy to see that the double sum over m and d is absolutely 
convergent. Collecting coefficients of g” by summing over all divisors d of n, 
we obtain: 


E,=1-—24¥ o,(n)q". (2.17) 
n=1 


As in the case of the higher E,, E, is a holomorphic function on H which 
is periodic of period 1 and holomorphic at infinity. In order for it to be a 
modular form of weight 2 for IT’, all we would need in addition is for 
z ?E,(—1/z) to equal E,(z). From (2.16) we find that 


= = x, 1 
Z °E(- 1/z) = ROE Eat ta ’ i 


2 ome (mz : ne 
Thus, the extent to which £, fails to satisfy the “right” rule is a reflection 
of the alteration produced by reversing the order of summation. We now 
compute this ‘“‘error term’’. 


Proposition 7. 


Ae is Be: a (2.19) 


ProorF. The proposition says that 12/27iz is the difference between the double 
sum (2.18) and the double sum (2.16). Suppose we introduce a “‘correction 
term”’ a,,,, inside both double sums which causes the double sums to be 
absolutely convergent. In that case, the order of summation of the “‘cor- 
rected’ double sums does not make any difference, i.e., the two ‘“‘corrected”’ 
double sums are equal. It then follows that the difference between (2.18) 
and (2.16) is equal to the difference between Z,,2,,@_,,, aNd LX, Ly Imp The 
idea is to choose a,, , to be a term close to (mz + ny? ‘but which is easier to 
sum. 
Let us take 


1 1 1 


Frm.n = Ary n(Z) = (mz +n — 1)(mz +n) a (mz+n—1) (mz +n)’ 


114 III. Modular Forms 


Since the difference between (mz +n)? and a,,, is 1/(mz + n)?(mz +n — 1), 
it follows that the following “corrected” E, is absolutely convergent : 


Biy=14+ 50 > (os ama?) 


neon es KUNZ ny 


ee ee eee 


2 
Te meon2nao(MZ+N) Mia sontco\mztn mztn— 


But since the last inner sum telescopes to zero, we have: E,(z) = E,(z). 
Because the double sum for £, is absolutely convergent, we have 


3 


E,(z)=1+-5 - > (ote am s(2)) 


Te ne mz +n) 
: ( ) (2.20) 
a2 E;(—1j2) = oy >; Qm,n(Z)- 
qT n=— 0 m#0 


So it remains to evaluate this last double sum. 
Now this double sum differs from the sum in (2.18) by an absolutely 
convergent series. As in the derivation of (2.17), we find that for n > 0 


1 1 1 1 
2 


= = 4n? x de~2nidn/z 
meo(mz—n)? 2? mao (—n/z + m)? n a iA : 
Since —1/z is a fixed element of H, we see that the outer sum over 7 in (2.18) 
converges absolutely. Thus, the same is true for ©, 2, @,,,(Z). This justifies 
writing 

oo N 

dd Gmal2)= lim SY Gn a(Z) 

n=~ 20 m#0 N> oO p= —N+1m#0 


N 


= tn 3 aa 5@): 


N> © m#0n=—-N+1 


The last inner sum telescopes to I/(mz — N) — 1/(mz + N), and by (2.9) 
we have 


l l 2< 1 1 
= + 
S(ty a1 a 2 (aya ain] 


We conclude that the double sum is equal to 


2n ,. Qn, .e 77iN/F 4] 2ni 
=— lim cot(—2N/z) = lim i : -_ : 
2 Wie ( / ) Z N-w e) 2iN/z | Zz 


Substituting this into (2.20) gives: 


12 
2niz 


E,(2) = 2°7E,(— 12) + = 2 7E,(— 1/2) - 


This completes the proof of Proposition 7. o 
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L ms oa 


-1/2 0 1/2 


Figure III.4. Contour for the proof of Proposition 8. 


The next result will play a basic role in determining the spaces M,(I-) and 
S,(7) of modular forms and cusp-forms of given weight for IF, and it will 
also be useful in proving that two modular forms defined in different ways 
are actually the same in certain cases. 


Proposition 8. Let f(z) be a nonzero modular function of weight k for T. 
For Pe H, let vp(f) denote the order of zero (or minus the order of pole) 
of f(z) at the point P. Let v,,(f) denote the index of the first nonvanishing 
term in the q-expansion of f(z). Then 


rN F GUN + z+ Tvl =S. 20) 


pee 
3 Pel\H.P#i.o 


(Note. It is easy to check that vp(/') does not change if P is replaced by 
yP for yeT.) 


Proor. The idea of the proof is to count the zeros and poles in I\H by 
integrating the logarithmic derivative of f(z) around the boundary of the 
fundamental domain F. More precisely, let C be the contour in Fig. III.4. 
The top of C is a horizontal line from H = 4+ iTto A = —4 + iT, where T 
is taken larger than the imaginary part of any of the zeros or poles of f(z). 
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(Note. That this can be done, i.e., that f(z) does not have poles or zeros 
with arbitrarily large imaginary part, follows from the fact that the change 
of variables g = e2"* makes f(z) into a meromorphic function of g in a disc 
around g = 0.) The rest of the contour follows around the boundary of F, 
except that it detours around any zero or pole on the boundary along circular 
arcs of small radius ¢. This is done in such a way as to include every I- 
equivalence class of zero or pole exactly once inside C, except that i and w 
(and Sw = —@) are kept outside of C if they are zeros or poles. In Fig. III.4 
we have illustrated the case when the zeros and poles on the boundary of F 
consist of i, w (and hence also Sw), one point P on the vertical boundary 
line (and hence also the T-equivalent point on the opposite line), and one 
point Q on the unit circle part of the boundary (and hence also SQ). 
According to the residue theorem, we have 


fi [feu Fah 2.22 
Cc 


. Zz as 
2Qni | f(z) PeD\H, P#i,0 


On the other hand, we evaluate the integral in (2.22) section by section. 
First of all, the integral from A to B (see Fig. III.4) cancels the integral 
from G to H, because f(z + 1) = f(z), and the lines go in opposite directions. 
Next, we evaluate the integral over HA. To do this we make the change 
of variables g = e?"*. Let f(g) =f(z) = Za,q" be the q-expansion. Since 
LO=a. f(q)44, we find that this section of the integral in (2.22) is equal to 
the following integral over the circle of radius e **" centered at zero: 

sake | df/dq d 


2ni 


fig)" 

Since the circle is traversed in a clockwise direction as z goes from H to A, 
it follows that this integral is minus the order of zero or pole of f(q) at 9, 
and this is what we mean by —v,,(/). 

To evaluate the integral over the arcs BC, DE, and FG, recall the derivation 
of the residue formula. If f(z) has Laurent expansion c,,(z — a)" + --- near 
a, with c,, #0, then f(z)/f(z) = 27 + g(z), with g(z) holomorphic at a. 
If we integrate /’(z)/f(z) counterclockwise around a circular arc of angle 
centered at a with small radius ¢, then as ¢ > 0 this integral approaches mi@ 
(the usual residue formula results when 6 = 27). We apply this to the section 
of (2.22) between B and C, letting ¢ > 0. The angle approaches 1/3, and so 
we obtain —31;(v,,(/)iz/3) = —v.(f)/6. (The minus sign is because the 
arc BC goes clockwise.) In the same way, we find that as ¢ > 0 the part of 
(2.22) from D to E becomes — 0;,(f)/2, and the part between F and G becomes 
—v_g(f/6 = —v4(f)/6. 

What remains is the integral from C to D and from E to F. Combining 
the above calculations, we find from (2.22) that the left side of (2.21) is 
equal to that remaining section of the left side of (2.22). Thus, to prove 
Proposition 8, it remains to show that in the limit as ¢ > 0 
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2ni J op fa)” * Oni he fa“ 3 
To compute the sum of these two integrals, we note that the transformation 
S: zt» —1/z takes CD to EF, or more precisely, to FE, i.e., Sz goes from F 
to E along the contour as z goes from C to D along the contour. The desired 
formula (2.23) will follow from the following more general lemma. 


Lf LO e250 ¢ $OR 2 co 


Lemma. Let y= (2 ‘jel, with c #0, and let f(z) be a meromorphic 
function on H with no zeros or poles on a contour C < H. Suppose that 
S(yz) = (cz + d)*f(z). Then 


fO,-{ £@, =—+| dz__ 2.24 
fo” lt Z coz t+ de) (2.24) 


The required equality (2.23) follows immediately from the lemma, where 
we set y = S=({ ~9), note that S(CD) = FE, and compute that as e > 0 


1/4 . 
ay ae | do = at (where z = e?7°), 
2ni a ia 12 


PROOF OF THE LEMMA. Differentiating 
f(yz) = (cz + a)'f(2), (2.25) 
we obtain 


ron = (cz + d)'f'(z) + ke(cz + dy f(z). (2.26) 


We now divide (2.26) by (2.25): 


AC ee AO 

fyz) 1 ~ fey" tesa 
Thus, the left side of (2.24) is equal to 

LQ LW) gp, = =-«| cde 

fey” ~ “fgz) oe +d 


This completes the proof of the lemma, and of Proposition 8. Oo 


We now derive several very useful consequences of Proposition 8. 


Proposition 9. Let k be an even integer, T = SL,(Z). 


(a) The only modular forms of weight 0 for T are constants, i.e., My(T) = 

(b) M,(T) = 0 ifk is negative or k = 2. 

(c) M,(1) is one-dimensional, generated by E,, ifk = 4, 6, 8, 10 or 14; in other 
words, M,(T) = CE, for those values of k. 

(d) S,7) = 0 ifk < 12 or k = 14; S,,(1) = CA; and for k > 14 S(T) = 
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AM,_,,(1) (i.e., the cusp forms of weight k are obtained by multiplying 
modular forms of weight k — 12 by the function A(z)). 
(e) M,(T) = S,(1) ® CE, for k > 2. 


Proor. Note that for a modular form all terms on the left in (2.21) are 
nonnegative. 


(a) Let fe M,(T), and let c be any value taken by /(z). Then f(z) — ce M,(T) 
has a zero, i.e., one of the terms on the left in (2.21) is strictly positive. 
Since the right side is 0, this can only happen if f(z) — c is the zero 
function. 

(b) If k <0 ork = 2, there is no way that the sum of nonnegative terms on 
the left in (2.21) could equal 4/12. 

(c) When k = 4, 6, 8, 10, or 14 we note that there is only one possible way 

of choosing the vp(/) so that (2.21) holds: 

for k = 4, we must have v,,(f) = 1, all other vp(/) = 0; 

for k = 6, we must have 0;( f) = 1, all other vp(/) = 0; 

for k = 8, we must have v,,(/) = 2, all other up(/) = 0; 

for k = 10, we must have v,(f) = 0;(f) = 1, all other vp(/) = 0; 
for k = 14, we must have v,(f) = 2, v)(f) = 1, all other vp(f) = 0. 

Let f,(z), f(z) be nonzero elements of M,(I). Since f,(z) and f,(z) have 

the same zeros, the weight zero modular function /, (z)/f,(z) is actually 

a modular form. By part (a), /; and f, are proportional. Choosing 

f2(z) = E,(z) gives part (c). 

For fe S,(T) we have v,,(/) > 0, and all other terms on the left in (2.21) 

are nonnegative. Notice that when k = 12 and f= A, (2.21) implies that 

the only zero of A(z) is at infinity. Hence, for any k and any fe S,(T), 

the modular function //A is actually a modular form, i.e., //Ae M,_,(T). 

This gives us all of the assertions in part (d). 

Since E, does not vanish at infinity, given fe M,(I) we can always 

subtract a suitable multiple of E, so that the resulting f— cE,¢M,(T) 

vanishes at infinity, i.e., f— cE, €S,(I). Oo 


(d 


~~ 


(e 


wm 


We now prove that any modular form for I is a polynomial in £4, E, 
(see Problem 5 of §1.6 for a different proof of this fact). 


Proposition 10. Any fe M,(T) can be written in the form 
fe) = oa Cea @y Eee). (2.27) 


4i+ 6j=k 


PRooF. We use induction on k. For k = 4, 6, 8, 10, 14 we note that E,, Eg, 
E2, E,£,, EZE., respectively, is an element of M,(I), and so, by Proposition 
9(c), must span M,(T). Now suppose that k = 12 or k > 14. It is clearly 
possible to find i and j such that 4i + 6j = k, in which case E, Ee M,(T). 
Given fe M,(I), by the same argument as in the proof of Proposition 9(e) 
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we can find ceC such that f— cE, EieS,(1). By part (d) of Proposition 9, 
we can write fin the form 


(2n)*? 


= cEi EF} =cEi Ei 
f= cE, Eg + Af, = cE, Eg + 1728 


(Ei — Ee) fy, 


where f, €M,_,,(I). Applying (2.27) to f, by the induction assumption 
(with k replaced by k — 12), we obtain the desired polynomial for /. Oo 


The j-invariant. We now define a very important modular function of weight 
zero: 


172892(2)° _ 44 A 


A(z) Roe one: CY CIe Py) C2) 
4 6 


IQs 


Proposition 11. The function j gives a bijection from T\H (the fundamental 
domain with Y-equivalent sides identified and the point at infinity included) 
and the Riemann sphere Pi = C vu {oo}. 


ProoF. In the proof of Proposition 9(d) we saw that A(z) has a simple zero 
at infinity and no other zero. Since g, does not vanish at infinity, this means 
that j(z) has a simple pole at infinity and is holomorphic on H. For any 
céC the modular form 1728g3 — cAeM,,(I) must vanish at exactly one 
point PeI\H, because when k = 12 exactly one of the terms on the left 
in (2.21) is strictly positive. Dividing by A, we see that this means that 
J(z) — c = 0 for exactly one value of zeI\H. Thus, j takes 00 to oo and on 
I\Z is a bijection with C. oO 


Proposition 12. The modular functions of weight zero for T are precisely the 
rational functions of j. 


Proor. A rational function of j(z) is a modular function of weight zero 
(see Remark 3 at the beginning of this section). Conversely, suppose that 
f(z) is a modular function of weight zero for I. If z; are the poles of f(z) 
in I’\H, counted with multiplicity, then f(z) - I, (j(z) — j(z,)) is a modular 
function of weight 0 with no poles in H, and it suffices to show that such a 
function is a rational function of j. So, without loss of generality we may 
assume that f(z) has no poles in H. We can next multiply by a suitable power 
of A to cancel the pole of f(z) at oo. Thus, for some k we will have A(z)*f(z) € 
M,»,(1). By Proposition 10, we can write f(z) as a linear combination of 
modular functions of the form E}E4/A* (where 4i + 6j = 12k), so it suffices 
to show that such a modular function is a rational expression in j. Since 
4i + 6j is divisible by 12, we must have i = 3/9 divisible by 3 and j = 2j, 
divisible by 2. But it is easy to check that F3/A and E2/A are each of the 
form aj + b, by (2.28); and Ez'eE2/0/A* is a product of such factors. This 
proves the proposition. oO 
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The definition (2.28) is not the only way one could have defined the 
invariant. It is not hard to see that any ratio of two non-proportional modular 
forms of weight 12 would have satisfied Propositions 11 and 12. But j(z) 
has the additional convenient properties that: its pole is at infinity, ie., 
it is holomorphic on H; and its residue at the pole is 1, as we easily compute 
from (2.28) that the g-expansion of j starts out} + ---. 

Recall that in Chapter I we used the Weierstrass go-function and its 
derivative to identify the analytic manifold C/Z with an elliptic curve in 
PZ. Similarly, in our present context we have used the j-invariant to identify 
I\H as an analytic manifold with the Riemann sphere P2. Proposition 12 
then amounts to saying that the only meromorphic functions on the Riemann 
sphere are the rational functions. Thus, Proposition 12 is analogous to 
Proposition 8 in §1.5, which characterized the field of elliptic functions as 
the rational functions of x = g(z), y = g'(z). 


A loose end from Chapter I. In Chapter I we defined an elliptic curve over C 
to be a curve given by an equation of the form y” = f(x), where f(x) is a 
cubic with distinct roots. We then worked with curves whose equations were 
written in the form 


y? = 4x? — go(L)x — g3(L) (2.29) 


for some lattice L. It is not hard to see that a linear change of variables will 
bring an equation y* = f(x), where f(x) has distinct roots, into the form 


y?=4x3~Ax—B with A*—27B* £0. (2.30) 


But in order to write such a curve in the form (2.29) we need to show that a 
lattice L can always be found such that g,(L) = A, g3(L) = B. This was not 
proved in Chapter I, but it is easy to prove now using modular forms. 

In what follows we shall write a lattice L = {mw, +nw,}, where z= 
@,/@,€H,as follows: L = AL, = {mdz + nd}. Here 7, = w,. Thus, any lattice 
L is a complex multiple of (i.e., a rotation plus expansion of) a lattice of 
the form L,. 


Proposition 13. For any A, BeC such that A? # 27B? there exists L=AL, 
such that 


g2(L) = A; (2.31) 
g3(L) = B. (2.32) 


ProoF. It follows immediately from the definition of g, that g,(AAL,) = 
4°*g,(L,), and similarly g,(AL,) = 4° °g3(L,). 

By (2.14), we can restate the proposition: there exist 2 and z such that 
E,(2) = 14(3/4n*)A and E,(z) = 4(27/8n°)B. Let a = 34/4n*, b = 27B/8n°. 
Then the condition A? 4 27B? becomes a? # b?. Suppose we find a value 
of z such that E,(z)?/E,(z)* = 6?/a>. Then choose 1 so that E,(z) = A‘a, 
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in which case we must have 
Ecgy =O L3G) [a =a" b, ie. Belay 24d. 


If we have + in the last equation, then our values of z and 4 have the required 
properties (2.31)—(2.32). If we have —, then we need only replace / by id. 
So it remains to find a value of z which gives the right ratio E,(z)?/E,(z)°. 
Now E2/E3 = 1 — 1728/j, by (2.28). Since j(z) takes all finite values on H, 
and b?/a? # 1, we can find a value of z such that b?/a? = 1 — 1728/j(z) = 
E,(z)?/E,(z)*. This completes the proof. Oo 


Thus, there was no loss of generality in Chapter I in taking our elliptic 
curves to be in the form (2.29) for some lattice L. 


The Dedekind eta-function and the product formula for A(z). We conclude 
this section by proving the functional equation for the Dedekind y-function, 
from which Jacobi’s product formula for A(z) will follow. 

The function n(z), zé H, is defined by the product 


n(Z) ne e2niz/24 I] el = e2rinzy (2.33) 
n=1 


(In Problem 7 of §II.4 we gave another definition, and another proof of the 
functional equation. In the problems below we shall see the equivalence of 
the two definitions.) 


Proposition 14. Lez age denote the branch of the square root having nonnegative 
real part. Then 


n(—1/z) = /z/in(). (2.34) 


Proor. The product (2.33) clearly converges to a nonzero value for any 
zé H, and defines a holomorphic function on H. Suppose we show that the 
logarithmic derivatives of the left and right sides of (2.34) are equal. Then 
(2.34) must hold up to a multiplicative constant; but substituting z =i 
shows that the constant must be 1. 
Now the logarithmic derivative of (2.33) is 
n(Z) _ 2ni ner 
n(z) 24 ( 24 ym zm): 


If we expand each term in the sum as a geometric series in g” (q = e?"”), 
and then collect terms with a given power of g, we find that 


WB-W(-aEeme)-Hao. an 


Meanwhile, the logarithmic derivative of the relation (2.34) that we 
want is: 


122 Il. Modular Forms 


n(—1/2),-2_1 ,"@® 
a ae ae (2.36) 


Using (2.35), we reduce (2.36) to showing that 


E,(—1/2)27? = 1 + E,(2), 
2niz 


and this is precisely Proposition 7. oO 
Proposition 15. 


(nya) =q][U—99%4, gs 0t®, (2.37) 
n=1 


Proor. Let f(z) be the product on the right of (2.37). The function f is 
holomorphic on H, periodic of period 1, and vanishes at infinity. Moreover, 
f(z) is the 24-th power of n(z), by definition ; and, raising both sides of (2.34) 
to the 24-th power, we find that f(—1/z) = z’?f(z). Thus, f(z) is a cusp form 
of weight 12 for ’. By Proposition 9(d), f(z) must be a constant multiple of 
A(z). Comparing the coefficient of q in their g-expansions, we conclude that 
f(z) equals (27) *?A(z). oO 


Notice the role of the Eisenstein series E,(z) in proving the functional 
equation for n(z), and then (2.37). The n-function turns up quite often in 
the study of modular forms. Some useful examples of modular forms, 
especially for congruence subgroups, can be built up from n(z) and functions 
of the form 4(Mz). 

The g-expansion in (2.37) is one of the famous series in number theory. Its 
coefficients are denoted 1(n) and called the Ramanujan function of n, since 
it was Ramanujan who proved or conjectured many of their properties: 

» t(n)q" 4 i (beghe, 

Among the properties which will be shown later: (1) t(7) is multiplicative 
(t(nm) = t(n)t(m) if n and m are relatively prime); (2) t(”) = 0, ,(”) (mod 
691) (see Problem 4 below); (3) t(m)/n° is bounded. Ramanujan conjectured 
a stronger bound than (3), namely: |t(7)| <n'*/o9(n) (where oo(n) is the 
number of divisors of n). The Ramanujan conjecture was finally proved ten 
years ago by Deligne as a consequence of his proof of the Weil conjectures. 
For more discussion of t(n) and references for the proofs, see, for example, 
[Serre 1977] and [Katz 1976a]. 


PROBLEMS 


1. Prove that fork>4: E,(z)=4 Yo (mz+n)y*. 


mneZ 
(m,n)=1 


§2. 


10. 
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_ What is E,(i)? 
. (a) Show that E? = Ey, E,E, = E,9, and E, Ey = Ey,. 


(b) Derive relations expressing o; in terms of 03; 0, in terms of 3; and o;; and 0,3 
in terms of 0, and o>. 


_ (a) Show that E,, — E2 = cA, with c = (2n)7!2-2° 35. 7?/691. 


(b) Derive an expression for t(”) in terms of o,, and os. 
(c) Prove that t(n) = o,,(”) (mod 691). 


. Prove that E, and E, are algebraically independent; in particular, the polynomial 
in Proposition 10 is unique. 
ino) nex” ie] 
. (a) Prove the identity 1 =e Y a,(n)x". 
n=1 4 n=1 


(b) Let a = 1 (mod 4) bean integer greater than 1. Show that £,,, (7) = 0, and prove 
the following sequence of summation formulas: 

1/504, a=5; 

1/264, a=9; 

1/24, a=13, etc. 


n‘ 1 
= B = 
e2m™ _ | a+ 1) a+1 


iMs 


. Let f(z) be a modular form of weight k for I. Let 


92) ==") - Rose). 


Prove that g(z) is a modular form of weight k + 2 for T, and that it is a cusp form 
if and only if f(z) is a cusp form. 


. (a) Prove that E, = E,E, — 33; Ej and E, = E,£, — 7;EG. 


(b) Derive relations expressing o, in terms of o, and ¢3, and a; in terms of o, and 
Os. 


. Recall the following definitions and relations from Problems 4 and 7 in §II.4. Let 


x be a nontrivial even primitive Dirichlet character mod N. Define 


O4,0= Y xe, Ret >0. 


n=1 


Then 


Ay, 1) = (N71) "7G (O, 1/N*1). 


Now let x be the character mod 12 such that y(+1) = 1, x(+5) = —1, and define 

A(z) = O(y, —iz/12) for ze H. Then 4(—1/z) = ./z/iq(z). 

(a) Prove that #(z + 1) = e?"!?4A(z), and that #7* eS, (I). 

(b) Prove that #(z) = n(z). 

(c) Write the equality in part (b) as an identity between formal power series in q. 
(Note. This identity is essentially Euler’s “‘pentagonal number theorem.” For a 
discussion of its combinatoric meaning and two more proofs of the identity, 
see [Andrews 1976, Corollaries 1.7 and 2.9].) 


Find the j-invariant of the elliptic curve in Problem 3(b) of §1.2, which comes from 
the generalized congruent number problem. What is / in the classical case 1 = 1? 
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Prove that there is no other value of 4¢Q for which the corresponding j is an 
integer. It is known that an elliptic curve with nonintegral j cannot have complex 
multiplication. 
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Let y= (2 >)eT = SL,(2), let f(z) be a function on H = HU Qv {co} with 
values in C U {co}, and let ke Z. We introduce the notation f|[7], to denote 
the function whose value at z is (cz + d)*f((az + b)/(cz + d)). We denote 


the value of f|[y], at z by f(2)|[y],: 
b 
fOlbhkRCzZ+O"fO2) for y= (? et (3.1) 


More generally, let GL3(Q) denote the subgroup of GL,(Q) consisting 
of matrices with positive determinant. Then we define 


b 
A) Lrla aq (det 1)'*(cz + V2) for r=(? 1 )eGLAO), 
(3.2) 


For example, if y = (§ 2) is a scalar matrix, we have f|[y], =f unless a < 0 
and k is odd, in which case f |[y], = —f. 

Care should be taken with this notation. For example, by definition 
f(2z)\[y], means f|[y], evaluated at 2z, ie., (2cz + d)*f((2az + b)/ 
(2cz + d)). This is not the same as g(z)|[y], for g the function defined by 
g(z) = f(2z); namely, g(z)|[y], = (cz + 4) *g(yz) = (cz + dy *f(2(az + b)/ 
(cz + d)). 

With this notation, any modular function of weight k for I’ satisfies 
f\Ly). =f for all yeT. Some functions are invariant under [y], for other 
ye GL} (Q). For example, recall the theta-function defined in §4 of Chapter 
Il: 0(t) =X,.7e°™” for Ret > 0. We saw that it satisfies the functional 
equation 0(t) = t~'/?6(1/t) (where ./ is the branch such that ,/1 = 1). We 
define a function of zeé H, also called the theta-function, by setting O(z) = 
6(—2iz), i.e., 


O@= Yer = Yq” for zeH, g=e**. 3.3) 


neZz neZ 


Substituting —2iz for ¢ in the functional equation for 0, we have 
O(z) = (22/i) 4? O(— 1/42). (3.4) 


Squaring both sides and using the notation (3.2), we can write 


©7|[y], = —i0? for r=(j ~ J eGL@. (3.5) 


§3. Modular forms for congruence subgroups 125 


Notice that 


f\ly Yolk = (Ff |Lyd0 [Dro] for 71, 72€GLz(Q). (3.6) 


To see this, write (det y)*?(cz +d) in the form (dyz/dz)¥?, and use the 
chain rule. 

We are now ready to define modular functions, modular forms, and cusp- 
forms for a congruence subgroup I’ cT. Let gy denote e277", 


Definition. Let f(z) be a meromorphic function on H, and let T’ cT bea 
congruence subgroup of level N, i.e., T’ > P(N). Let keEZ. We call f(z) a 
modular function of weight k for I’ if 


Sly =f forall yer, (3.7) 
and if, for any y)>€T = SL,(Z2), 
S(2\Lyo], has the form Yagi with a,=0 for n<0. (3.8) 


We call such an f(z) a modular form of weight k for I’ if it is holomorphic 
on H and if for all yyeET we have a, = 0 for all n < 0 in (3.8). We call a 
modular form a cusp-form if in addition a) = 0 in (3.8) for all yy eT. 

Thus, as in the case ’’ = T treated in §2, a modular “function” is allowed 
to have poles of finite order, a “form”? must be holomorphic at all points 
including the cusps, and a “‘cusp-form’’ must vanish at all cusps. This 
interpretation of (3.8) as a condition ‘‘at the cusps”’ will be explained below. 

The first condition (3.7) is the obvious analog of the first condition (2.1) 
for modular functions for . The second condition (3.8) is called ‘““meromor- 
phicity”’ at the cusps (““holomorphicity” if a, = 0 for n <0, “‘vanishing” 
if a, = 0 for n < 0). We now explain this further. 

Let g =f |[yo}, for some fixed »)¢GL}(Q). If f is invariant under I’, 

Le., if f lly) =f for yeI’, then it follows from (3.6) that g is invariant 
under the group yo'I’yo: for all yo*y7yo€y0'F'’y9 we have g|[yo!yvolk = 
(FS |Drodd Lo vv0lk =f [Lovo = Fr] 0|Dodk =f |Lrolk = g- In particular, 
if yoeT and I’ >T(N), then yo'T’yo also contains T(N) (since I'(N) is 
normal in I), and so g = f|[y], is invariant under P(N). Because T™ = 
6 EID (N), we have g(z + N) =g(z), and so g =f|[y], has a Fourier 
series expansion in powers of gy = e?"7". The content of condition (3.8) 
is that this expansion has only finitely many negative powers of gy (no 
negative powers for holomorphicity, only positive powers for vanishing). 

It may happen that g = f|[y], is invariant under a smaller translation 
T", where A|N, i.e., g(z + h) = g(z). In that case the only powers of gy that 
appear in the Fourier series are powers of q, = qv". For example, if yp = J 
and I’ =1,(N), then Tel” and g(z + 1) = g(z). In that case g = f has an 
expansion in powers of g = g, = gy. On the other hand, for T’ = T,(N) 
and yy = S=({ ~6) we have yo'T’79 =I °(N) = {(* 2) mod N}; thus, if 
J is a modular function for P(N), in general we only have g(z + A) = g(z) 
for h = N, where g =f |[S ],. 
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Any cusp sé @ vu {00} can be written in the form s = «too for some a = 
@ + eQ (by writing s = —d/c in lowest terms and finding any solution a 
and b to the equation ad — bc = 1). If we set yy = «* in (3.8), the behavior 
of g=f|[«"'], near oo (i.e., near qy = 0) is a reflection of the behavior 
of fnear s, since g(z) = (—cz + a) “f((dz — b)/(—cz + a)). 


Proposition 16. The condition (3.8) depends only on the Y’-equivalence class 
of s = 7,0. More precisely, ify, 00 = y’y, 00 for some y’ ET’, then the smallest 
power of qy that occurs in the Fourier expansion of f|[y,], and f|[y2], és 
the same. Moreover, if this smallest power is the constant term, then the 
value at qy = 0 is the same for f|[y,], and f\[y2], if k is even; if k is odd, 
this value may at most change sign. 


Proor. If y,0o = y’y,00, then the element y;'y’y,¢I keeps oo fixed, in 
which case it must be of the form + 7/. (Note that (2 3)co = is equivalent 
toc = 0, and the elements of F with c = O are + 7/.) Thus, we have y;'y’y2 = 
+Ti, so that y, = +y7'y, T!. Let g(@) = f@|[k = Danan. Since f|[—T]k 
=(—1)f, and f|[y’"], =f (because »’eI’), it follows that f|[)],= 
(+1*(f|DJOLT 7 = (+ 1)"g|[T"],. Thus, 


f@|\Lr2k = (+ D9 +) = (4D Ya? GN. 


In other words, the gy-expansion coefficients corresponding to y, differ 
from those corresponding to y, only by roots of unity. The proposition now 
follows immediately. Oo 


If f is a meromorphic function on H which is invariant under [y’], for 
y el’, and if seQ U {00} with s = 7.0, yo ET, then we say that fis mero- 
morphic (is holomorphic, vanishes) at the cusp s if f|[yo], has a Fourier 
expansion with only finitely many negative terms (respectively, with no 
negative terms, with no negative terms or constant term). Proposition 16 
says that meromorphicity, holomorphicity, vanishing at s does not depend 
on the choice of y, for which s = y,00, and in fact only depends on the 
T’’-equivalence class of s. 

Thus, the condition (3.8) is really a set of conditions, one corresponding 
to each cusp s of I’. (Recall that ‘‘cusp of I’’”’ means “‘T’’-equivalence class 
of cusps”’.) For example, if I’ = I'o(p) for p a prime, we saw (Problem 18 
in §III.1) that there are only two cusps oo, 0. Thus, the condition (3.8) 
amounts to the two conditions 


f(z) = Ya,4", qg=e", with a,=0 for n<«0; (3.9) 
z*f(-I/z)= bgt, 9, =e", with b6,=0 for n<0O. 
(3.10) 


We call the Fourier series in (3.9) the g-expansion of f at 00, and we call 
(3.10) the g,-expansion of f at the cusp 0. If f is holomorphic, we write 
f(a) for ay and f(0) for 6b). Note that f(0) is not the limit of f(z) as z > 0. 
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We let M,(I’) and S,(I"’) denote the set of modular forms of weight k 
for I’ and the set of cusp-forms of weight k for I’, respectively. As in the 
case I’ = I treated in the last section, it is easy to see that these are C-vector 
spaces, that fe M, (I) and ge M,,(T’) implies fgeM, .,,(’), and that the 
vector space of weight zero modular functions for I’ is a field. Also note 
that if —/eI’, then there are no nonzero modular functions for I’ of odd 
weight k, since then f|[—/], = —f. 

It is immediate from the definition that if ©” < I’, then a modular func- 
tion/modular form/cusp-form for I’ is also a modular function/modular 
form/cusp-form for I”. 

There are more interesting ways to get modular forms for a congruence 
subgroup I” from forms for another subgroup I’. For example, if f(z) = 
La,q"eM,(1), then f(Nz) = La,qN" and (2) = Xa, y(n)q" (the “twist” of 
J by a Dirichlet character y) turn out to be modular forms, although for a 
smaller congruence subgroup than I. The next proposition gives two im- 
portant classes of constructions of this type. In part (b) of the proposition we 
use the notation M,(N, x) with x a Dirichlet character mod N to denote the 
subspace of M,(I°,(N)) (see (1.5)) consisting of f(z) for which /|[y], = x(@) f 
whenever y = (2 %)EI(N). In particular, for y the trivial character 
MON, Xeriv) = M(To(N)). 


Proposition 17. (a) Let T’ be a congruence subgroup of V, let «€GL%(Q), 
and set” =a 'T’aaT. Then T” is a congruence subgroup of T, and the 
map fr>f |[o], takes M,(T’) to M,(0”), and takes S,() to S,(P”). In par- 
ticular, if fe M,() and g(z) = f(Nz), then g€ M,(T(N)) and one has g(«) = 
f(x), g(0) = N-*f(0). 

(b) Let y be a Dirichlet character modulo M, and let y, be a primitive 
Dirichlet character modulo N. If f(z) = X%-94,q"€M,(M, y) and aA 
Zi=0 InX1()q", then f, €M,(MN?, 7x7). If f is a cusp form, then so is fy, In 
particular, if f €M,(I9(M)) and x, is quadratic (i.e., takes values +1), then 
fy, © M,(To(MN?)). 


PROOF. (a) We need two lemmas. 


Lemma 1. Let «€GL}(Q) have integer entries, and let D = det «. fa 
T(N), then «'T’a > T(ND). 


PROOF OF LEMMA 1. Suppose yeI(ND), ie., y= 1+ NDB for some 
2 x 2-matrix f with integer entries and det y = 1. We must show that 
yea 'Ta, ie., that P’ asaya"! = a(1 + NDB)o-! = 1+ NDafa-. But a’ = 
Do"! isan integer matrix. Since det ayo” = det y= land aya! = 1+ NaBa’, 
we have aya 'eI(N) <I’, as claimed. g 


Lemma 2. Suppose that f(z) has the property (3.8) for all y,éT, Le, f(2)|[yo]k 
= Trin, IngN (where no = 0 if f(z) is holomorphic at the cusps, No = 1 if f(z) 
vanishes at the cusps). Then f(z) has the same property for all “2EGL;(Q), 
ie., f(z)|[o], = Lan, Ondnp (for some positive integers a and D which depend 
on a). 
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PROOF OF LEMMA 2. Since « can be multiplied by a positive scalar without 
affecting [«],, without loss of generality we may suppose that « has integer 
entries. It is an easy exercise in linear algebra to show that there exists yp ¢T = 
SL,(Z) such that yo'« = (§ 5), where a and d are positive integers. Then 


flo = S@|L70]0)| I( I. 


= (ad)¥?d-* . a,e2Rinlaz+ bydYN 


n=Ny 


oO 
= (ala)? Y a,e?inehigey 


= Y bina 
n=ang 
where 
a 0 if an; 
no (ajaye2nien a if a|n. 
This proves the lemma. oO 


We now turn to the proof of the proposition. The first assertion in part (c) 
follows from Lemma 1. Now suppose that fe M,(I’’). Then for «7! y’aeT’” 
(where y’eI’) we have (f\[o],)|[e yeh = (| Wilk =S|[ohs in 
addition, for yo €T we have (f|[«],)|[vo]. =| Lovo], and the condition (3.8) 
holds for f|[«],, by Lemma 2. 

Thus, f|[«],¢M@,(I"”). If f vanishes at all of the cusps, then so does f|[«],, 
by Lemma 2. To obtain the last assertion in Proposition 17(a), we write 
a=() {),g=N “*f|[o],, and note that «'Ta0T =I(N). The values 
at the two cusps oo and 0 come from the above formula for 5, with n = 0 
and « replaced by (¥ 9) at the cusp co and by (% °)S = S(j X) at the cusp 0. 
This completes the proof of Proposiicn 17(a). 

(b) Let € = e?”"", and let g = X59 x, (/)& be the Gauss sum. Then 


10-5, 00 ¥ (Ee) at 


Dt iv)z, ye” > Qn e2tin(z— v/N) 


n=0 


=e zit 


aa 1) fz — v/N). 
Now let y= (2 4)€I'9(MN7’). We want to examine f, (yz). Let y, denote 


(4 ~7’*). Then for each v and v’, 0 < v, v’ < N, we compute 


_,__ {@—ev/N b+ (va—vd)/N — evv'|N? 
bey c d+cv/N 
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invariant under [y], for yeT, (NV). It remains to check the holomorphicity 
condition at the cusps. But [y], permutes the G,;"""" for yyéT, by (3.13). 
Hence it suffices to show that each G, is finite at infinity. But 

, 0 ifa, #0; 

lim G,(z) = y m;* = ~ : 


zie mod N,m,=0 xy n if a,= 0. 
n=a,modN 


m= 


The sum £n~* over n = a, mod N converges because k > 3. It is essentially 
a “partial” zeta-function. More precisely, 


GAo"(c0) = C44(k) + (— DAE (K), where KY’ 


n>1 
n=amodN 
(3.14) 
This completes the proof of the proposition. Oo 
As a special case of (3.13), if we set y = —J we have 
G," = (—1)'G;. (3.15) 


This can also be seen directly from the definition (3.12). Thus, for example, 
G, = Oif k is odd and 2a = 0 mod N. 

It is now not hard to construct modular forms for any congruence sub- 
group I’, l > Tl’ >T(N), out of the Eisenstein series Go mod For fixed a, 
the elements yel”’ permute the Eisenstein series G, , where a’ ranges over 
the orbit of a under the action of I’, 


a’ eal’ = Sei 


Let r= #(al’) be the number of sence in the orbit. If F(X,, ..., X,) is 
any homogeneous Syuimeinic polynomial in r variables with total degree d, 
and we set X; = G,’) (where ay; runs through the orbit aI’), then F(G, a 
,G.’’) is easily seen to be a modular form of weight kd for I’. For 
example, taking F to be ¥, + --- + X, or X, ... X,, we have 
Gyo?) ON (z) Ee M,(T'o(N)); 


a, €(Z/NZ)* 


Gl0-42) mod Nizye Many(Po(N))- 


ay €(Z/NZ)* 


(3.16) 


We now compute the qy-expansion for G,. We shall be especially in- 
terested in the cases when a, = 0 or a, = 0. Recall the formula used in 
deriving the g-expansion for G, in the last section (see the proof of Pro- 
position 6): 

1 
Deane ‘2609-4 Le 1 e2tliz for k>2, zed. 
(3.17) 


Let 
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Since g|[T]s = g, and [4/ ], is always trivial, it follows that g is invariant 
under 4«7%, as desired. Oo 


Eisenstein series. Let N be a positive integer. Let a = (a,, a,) be a pair of 
integers modulo N. We shall either consider the a; as elements of Z/N Z or 
as integers in the range 0 < a; < N. Let k be an integer at least 3. Let m= 
(m,, Mz) denote a pair of integers. We shall think of a and m as row vectors. 
For zéH we define the “‘level N Eisenstein series” (corresponding to a and 
k) as follows: 


1 


(mz +m)" (3.12) 


def 


a amod N 
GOOG Oe L 
sana 
If a = (0, 0), we delete the pair m = (0, 0) in the sum (3.12). But there is 
no point in considering the case when a = 0, since, setting m, = Nm, m, = 
Nn, we have 
Ge(z)=N7* YY (mz +n) * = N“*G,(2), 
m,neZ 
which is the Eisenstein series for ! which we already studied in §2. In what 
follows we shall suppose that a # (0, 0). 
Notice that we are allowing k > 3 to be either odd or even. 


Proposition 21. G, Se 


eM,(T(N)), and Gio? 9% € M,(T,(N)). 

Proor. First, the series (3.12) is absolutely and uniformly convergent for 
z in any compact subset of H, because k > 3 (see, for example, Problem 
3 in $1.5). Hence G,(z) is holomorphic in H. Now let y = (@ ‘yeT. Then 


Ge) |[y = (cz + d)* a iz k 
aoe) 


=dmoan (Mya +m,,¢)z + (Mm, b + m,d))*" 


Let m’ =(m,a+m,c,m,b+m),d)=(m, m,)( 2) =my. Note that 
modulo N we have m’ = ay. Let a’ = ay (reduced modulo N). Then the 
maps m+>m' = my, and m’t+>m = my" give a one-to-one correspondence 
between pairs meZ? with m=amod N and pairs meZ? with m= 
a mod N. This means that the last sum above is equal to Lm =a’ mod n(z + 
m;)* = G, (z). Thus, 


Gor N iy], = GP” for yeYl. (3.13) 


If ye (N), then by definition y = J mod N, and so ay = a mod N. Thus, 
(3.13) shows that G, is invariant under [y], for y¢I'(V). Similarly, if ye 
T,(N) and a, = 0, we have (0, a))y = (0, a2) mod N, and so G2) ™4" is 
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cusp-form of weight k for [,(N) must be a multiple of (y(z)n(Nz))*. Here 
n(z) = e7"74 TT, (1 — q"), g = e2*, as in §2. 


Proposition 19. Let f(z) be a nonzero element of S,(To(N)), where N = 2, 
3, 5, or 11 and k = 8, 6, 4, or 2, respectively, so that k(N + 1) = 24. Then 
F(Z) is a constant multiple of g(z) = (n(z)n(Nz))*. 


Proor. By Proposition 17(a), g(z)%*! = A(z)A(Nz) is an element of 
S54(['9(N)). In addition, g(z)**! is nonzero on H, since A(z) #0 on H. 
At infinity, g(z)"*! = q%*! (1 — q")**(1 — q")*4 has a zero of order N + 1 
in its g-expansion. At the cusp zero, we write 


g(2)"**|[S Jog = 2°-24A(— 1/2)A(—N/z) = 27 742!7A(2)(Z/N)!2A (E/N) 
= N~ 9" TT 1 = gh)?4(1 — gh), 


which has a zero of order N + 1 in its gy-expansion. On the other hand, 
since fe S,([o(V)), its g-expansion at oo is divisible by g, and the qy- 
expansion of f|[S ], is divisible by gy. Now (f/g)%*!, as a ratio of two 
elements of S,,(I9(N)), is a modular function of weight zero for ,(N). 
Since g(z) #0 on H, this ratio is holomorphic on H. Moreover, the g- 
expansion of f*! is divisible at least by g¥*, and at zero the gy-expansion 
is divisible at least by gy*!. Hence, the g%*! in g%*! and the g¥*! in 
g” IES] 24 are canceled, and the ratio is holomorphic at the cusps, i.e., 
(4/9)**' € My(To(N)). By Proposition 18, (f/g)**! is a constant, and hence 
Fig is also a constant. This completes the proof. Oo 


Notice that we did not actually prove that g(z) = (y(z)n(Nz))* is in 
S,(Po(NV)), unless we can be assured that there exists a nonzero element 
SE S(T o(N)). The same proof, for example, would tell us that any nonzero 
SE S3(Fo(7)) must be a constant multiple of (4(z)n(7z))?; but S3(I',(7)) = 0, 
since 3 is odd and —/eT,(7). However, for the values of N and & in Pro- 
position 19 it can be shown that dim S,(T'9(N)) = 1 (see Theorem 2.24 and 
Proposition 1.43 in Chapter 2 of [Shimura 1971]). Thus, Proposition 19 
is not vacuous. For N = 2, k = 8 we can see this directly, since we know 
that Tand ST*S = (~} _?) generate I')(2) (see Problem 13(b) of §III.1). 


Proposition 20. (q(z)n(2z))® € Sg(F'o(2)). 


Proor. Clearly, g(z) = (y(z)n(2z))® is holomorphic on H. Its g-expansion 
at oo is: (eres 2* PIT _ gel _ q?"ys mis quad — ga — qn), 
Using the relation 7(—1/z) = /z/in(z), we easily see that g(z) vanishes at 
the cusp 0 as well. It remains to show invariance under [(~} _°)]g. Set 
a=($ ~$). Then (~} _9) = 4a7w, and 


g(z)|[a]s = 2*(2z)”*(n(— 1/2z)n(— 1/z))® = (227) *(,/2z/in(2z),/Z/in(Z))® 
= (n(z)n(2z))* = g(2). 


§3. Modular forms for congruence subgroups 129 


If v’ is chosen for each v so that v’a = vd mod N (such a choice is unique, 
because a and d are prime to N), then we have y,77,' €T')(M), and so 


f,02 =< ty LOS WZ) 


N-1 


le iv) x(a) (cyz + d + ev’ NYPL Qy2) 


= x(d)(ez + as, by Lif — v/N). 


But %1(%) = 4101) %1 @/%s@) = 14 (d)7H1(”’). Thus, 


N-1 
fy,.02) = wad) (cz + ae x MOU — VIN) = wG(d)cz + af, @), 


and so f,, has the right transformation formula to be in M(MN?, x77). 
The cusp conditions are verified by the same method as in the proof of part 
(a) of the proposition. Namely, for allyeT’, f 2) [y], is a linear combination 
of f(z)|Ly,7],, and so the cusp conditions follow from Lemma 2. a 


The next proposition generalizes Proposition 9(a) in the last section. 
Like Proposition 9(a), it is useful in proving equality of two modular forms 
from information about their zeros. 


Proposition 18. M,(I’) = C for any congruence subgroup Y’ <1. That is, 
there are no non-constant modular forms of weight zero. 


Proor. Let fe M,(I’), and let a = f(Zp) for some fixed zy) ¢ H. Let = Ua,’ 
be a disjoint union of cosets, and consider g = N(f [ote der @)1.6y 


g(z) = [[ (f(a *z) — ). (3.11) 


Then g(z) is holomorphic on H, and it satisfies (3.8), because f does. More- 
over, given y'eI we have g|[y ‘Jo = U(f|L(%) "Jo — 2). But since 
f |[a~*]o does not change if « is replaced by another element ay ‘eal’, and 
since left multiplication by y permutes the cosets aI’, it follows that 
{ f|L(ya;)~*]o} is merely a rearrangement of { f|[4; *]o}- “Thus, g|Ly ‘Jo = 9, 
and we conclude that ge M,(I). By Proposition 9(a), g is a constant. Since 
the term in (3.11) corresponding to the coset JT” is f(z) — a, it follows that 
for z = Z, the product (3.11) includes a zero factor. Thus, g = 0. Then one 
of the factors in (3.11) must be the zero function (since the meromorphic 
functions on H form a field). That is, f(«;'z) — a= 0 for some j and for 
all zeH. Replacing z by az, we have: f(z) = a for all ze H, as claimed. O 


As an example of the applications of Proposition 18, we show that for 
any positive integers N and k (with k even) such that kK(N + 1) = 24, a 


§3. Modular forms for congruence subgroups 133 


_ (HVE 12K), bo = be 0 ifa, #0; 
kit NB Ok det | c%2(k) + (— Dkk) fa, = 0. 
(3.18) 
Then 
G, (z) = bo + Y Y (myz+m,)* 
m,=a,modN,m,#0 m,=a, modN 
-k 
=b+N* Y > (mete #2 4 n) 
m,=a,modN,m,>0 neZ 


a N~*(—1)* sy y (met a, +n) 


m,=—a,modN,m,>0 neZ 


Co 
= bo + a( y +3 pe beth taniN) 


m,=a,modN j=1 
m,>0 


+(—1)* Y yi ¢k-1 e2Rii((m z— vy 


m,=—a,modN 
m,>0 


In these computations we had to split up the sum into two parts, with m, 
replaced by —m, for m, negative, because in applying (3.17) with z replaced 
by (m,z + a,)/N we need (m,z + a,)/NEH, i.e., m, > 0. Now let 


Exer%, gy = 02H, (3.19) 
Then 
Gi(z) = b5 + «( y ny jh gitagyem 
a oe (3.20) 
Ean Ys pte at). 
meee j=l 


To find the coefficient b, of qi, it remains to gather together terms with 
jm, =n. We shall only do this in the cases when a, = 0 and a, = 0. Asa 
result, we have the following proposition. 


Proposition 22. Let c,, 50 ,,, , dy be as in (3.18)—(3.19). For k > 3 let Game (zy 
be the Eisenstein series (3.12). Then the qy-expansion of Ge met 


Ge (2) = Bo, a+ x mn kIN (3.21) 
can be computed from (3.20). If a = (a,, 0), then forn > 1 


oa Y 4p YD ae (3.22) 


j|n ln 
nij=a,modN nfj=—a,modN 
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If a = (0, a,), then forn > 1 
bie =O if Nyns ine = ced HEM + (— KEM). (3.23) 
i|n 
Thus, for a = (0, a3) 


G°:42(z) aes bO,e2) + Cy oy (= P ieee (tes Be (— ye") q", q= e2niz, 
n=1 j|n 
‘ (3.24) 
Proposition 23. Jf 2a = (0, 0) mod N and k is odd, then G"°*™ = 0. Other- 
wise, G{°r2) is nonzero at oo, and G{"” has a zero of order min(a,, N — 4,), 
where we are taking a, in the range 0 <a, <N. That is, for a = (ay, 0) the 
first power of qy which occurs in (3.21) with nonzero coefficient is qx} or ay oS, 


Proor. The first assertion we already saw as a result of (3.15). We now check 
that (3.14) is nonzero (unless N |2a, and k is odd). If k is even, then we have 
a sum of positive terms. If k is odd and we take 0 < a, < N, then the sum 
in (3.14) is equal to 


> ( 1 1 >0 ifa, < N/2; 

Km (a, + nN) (N-—a,+nN)J‘} | <0 ifa, > N/2. 

Finally, we look for the first possible value of in (3.22) for which either 

sum in (3.22) is nonzero. That value is m = min(a,, N — a,), where we have 

the possible value j = | in one of the two sums. Thus, ae = +c, forn= 

min(a,, N — a,). This completes the proof. Oo 
As an application, we show that a certain product of Go can be 

expressed in terms of the 4-function. We shall use this result in the next 

section, where we give Hecke’s proof of the transformation formula for ©(z). 
Recall the Weierstrass y-function and its derivatives from Chapter I: 


1 : 1 1 
P(E; 01,2.) =a > ( ) 


m,neZ (z + ma, + nw )* (mo, + N@>)? 


'(Z3@,,@,)=—-2 ¥ (z+mo,+nw,)?; 


mneZ 


"2; @,,@,) =(-1IkK-D! Y @+mo,+nw)"*, k23. 
m,neZ 


amo 


If a # (0, 0), we can express G, 


e 
gamed are =N* > (az f@ 4omz+ n) 
minez (3.25) 


_ (—1)}' (k-2) {412 + 42 . 
— we —-)!? meee es 


This is the value of g*~” for the lattice L, = {mz +n} at a point of order 


*"(z) in terms of o*~2) as follows: 
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N modulo the lattice, namely (a,z + a,)/N. Thus, the Eisenstein series for 
I(N) are related to the values of the derivatives of g at division points. 

Now suppose that & = 3. By (3.25), G3(z) can vanish only if @’((a,z + 
a,)/N; z, 1) = 0. But 9’ vanishes only at half-lattice points (see the end of 
§1.4). If 2a = (0, 0) mod N, i.e., if 2(a,z + a3)/N is a point in L,, then G3 is 
the zero function, by Proposition 23. Otherwise, ”9’((a,z + a2)/N; z, 1) is 
nonzero. We have proved 


amod N 


Proposition 24. [f 2a = 0 mod N, then G3 (z) #0 for zeH. 


Let p be an odd prime. We now define 


p-1 
h(z) = [] G§0-42™°8?(z). (3.26) 
a,=1 
Proposition 25. h(z)¢M3,,-,)(T'o(p)), its only zero is a (p* — 1)/4 — fold 
zero at 0, and it is a constant multiple of (n?(z)/n(pz))°. 


ProoF. The first part is the special case of (3.16) when N = p, k = 3. h(z) 
is nonzero on H by Proposition 24, and at infinity by Proposition 23. To 
find its order of zero at 0, we examine the g,-expansion of 


pl p-1 
BILS Isie-i = I] Clear aaa oa = I] G{¢2"9) mod p 
a= a,=1 


by (3.13). According to Proposition 23, the first power of g, which appears 
in G{2 med? js gmin(a2.P~42) Thus, the order of zero of A(z) at 0 is 
p-1 (p—1y/2 


Y min(@a,,p—a,)=2 Yo a,=(p?— 1/4. 


a,=1 a,=1 
Now set h(z) = (n?(z)/n(pz))°. Then h(z)* = A(z)?/A(pz) is holomorphic 
and nonzero on H, since A(z) is holomorphic and nonzero on H. Because 
A(z)ES;2(T) < S,,(Mo(p)) and A(pz)€S,2(To(p)) by Proposition 17(a), it 
follows that h(z)* is a modular function of weight 12(p — 1) for T(p). Its 
q-expansion at infinity is 
a’ 1d — 9")*"/q? Td — 9)** = TI (d — 9"P/ — 9"); 
hence h(z)* is holomorphic and nonzero at oo. At the cusp 0 we have 
h(2)*|[S]1 apa) = 21” PA(— 1/2)P/A(—pi2) 
= 27120P-1)212PA(2)P/((z/p)'2A(z/p)) 
p'?A(z)?/A(z/p) = p'?q? Td — 9")?4"/4, TA — 9%)*4, 
which has leading term p'?q?"~. Thus, both h* and h* are elements of 
M, xp-1)T o(p)) with no zero except for a (p? — 1)-order zero at 0. Hence 


their ratio is a constant by Proposition 18. But (h/h)* = const implies that 
h/h = const. This concludes the proof of Proposition 25. Oo 
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Notice that, by (3.15), we have some duplication in the definition (3.26) 
of h(z). That is, if we define 


(p71)/2 
f= JT] Gi™*?@), (3.27) 
a,=1 
we find, by (3.15), that 
A(z) = (—1)'?- PP F(@)?. (3.28) 


Proposition 25 then tells us that the square of the ratio of f(z) to (n?(z)/n( pz))> 
is a constant. Hence, the ratio of those two functions must itself be a con- 
stant, and we have proved 


Proposition 26. The function f(z) defined in (3.27) is a constant multiple of 
(n?(z)/n(pz))?. 


Because each of the G{°-”’ in (3.27) is in M3(I',(p)) by Proposition 21, it 
follows that fe M3)—1)2(T'(p)). However, unlike A(z), f(z) is not, strictly 
speaking, a modular form for the larger group Io(p). 


Proposition 27. Let f(z) be defined by (3.27), and let y = (¢ ye y(p). Then 
f\Lydsp-pe = OF where () is the Legendre symbol (which is +1, depending 
on whether or not d is a square modulo p). 


Proor. Since (0, a,)(¢ %) = (0, daz) mod p, it follows by (3.13) that 
(p12 
f |Drdsw-ny2 = I] GP teeny, 

But by (3.15), the terms in this product are a rearrangement of (3.27), except 
that a minus sign is introduced every time the least positive residue of da, 
modulo p falls in the range (p + 1)/2, (p + 3)/2, ..., p — 1. Let n, be the 
number of times this occurs. Thus, f|[y]3¢p-1y2 = (—D"4f, According to 
Gauss’s lemma, which is an easily proved fact from elementary number 
theory (see, for example, p. 74 of [Hardy and Wright 1960]), we have (—1)"4 
= ($). This proves the proposition. Oo 


The transformation formula in Proposition 27 is an example of a general 
relationship between modular forms for I',(N) and “‘twisted-modular” 
forms for 'y(N), called “modular forms with character’. We now discuss 
this relationship. We start with some very general observations. 

Suppose that I’” is a subgroup of I’, and f(z) is a modular form of weight 
k for the smaller group ’” but not necessarily for the bigger one. Then for 
y eI’ we can at least say that f|[)~*], only depends on the coset of y modulo 
I”. That is, if py’ eT”, then f|[Qy’) *h =S|Ly*e- 

Now suppose that the subgroup I” is normal in I’. To every coset yI” 
associate the linear map f+>f|[y~'], which takes an element in M,(I’”) to 
M,(0”) by Proposition 17 (with I” and y~' in place of I’ and «; note that 
yP’y tA =Y’, since yeI’ and I” is normal in I’). This gives us a group 
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homomorphism p from T’/T” to the linear automorphisms of the vector 
space M,(I”), because for y,, y,ET’ 


P12) SOS |LOry) hk = Abr ddl Dr hk = 01) (oa) f). 


In other words, we have what is called a “representation” of the group 
I’/I’” in the vector space M,(I°”). 

If x: T/T” > C* is a character of the quotient group, then define M4,(T’, x) 
to be the subspace of M,(I’”) consisting of modular forms on which the 
representation p acts by scalar multiplication by y, i.e., 


MT’, 0) ei SEMO OO) S= x0) f for all ye T’}. 


If [’/l’” happens to be abelian, then according to a basic fact about repre- 
sentations of finite abelian groups, M,([”) decomposes into a direct sum 
of M,(T’, x) over the various characters y of T’’/I’”. We shall soon recall the 
simple proof of this fact in the special case that will interest us. 

We apply these observations to the case [” = T',(N), T’ = T(N). Since 
I’, (V) is the kernel of the surjective homomorphism from I')(NV) to (Z/N Z)* 
that takes (% 4) to d, it follows that ',(N) is a normal subgroup of I'y(N) 
with abelian quotient group isomorphic to (Z/N Z)* (see Problems 1-2 of 
§HI.1). Let y be any Dirichlet character modulo N, i.e., any character of 
(Z/NZ)*. In this context the subspace M,(I9(N), x) < M,(T,(V)) is usually 
abbreviated M,(N, x). That is, 


a b 
MAN, 0) & {ye MT (N))|f |v = x@)f for y = 6 ) era} 
(3.29) 
In particular, if y = 1 is the trivial character, then M,(N, 1) = M,(T9(N)). 


Proposition 28. M,(T ,(V)) = @M,(N, x), where the sum is over all Dirichlet 
characters modulo N. 


Proor. As mentioned before, this proposition is actually a special case 
of the basic fact from representation theory that any representation of a 
finite abelian group decomposes into a direct sum of characters. However, 
we shall give an explicit proof anyway. 

First, any function f that satisfies f |[y], = x(d)ffor two distinct characters 
x must clearly be zero; hence, it suffices to show that any fe M,(I',(NV)) can 
be written as a sum of functions f,€ M,(N, x). Let 


eet 7 
h= IN) ame 14)f |Lvals 


where y, is any element of I)(V) with lower-right entry congruent to d 
mod N. We check that for y= (¢@ 2)eT)(N) 


1 
= —— y d ’ 
cAlbap b(N) sedarap YW \Dvaa i 


which, if we replace dd’ by d as the variable of summation, is easily seen 
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to equal y(d’)f,, i.e., f,€M,(N, x). Finally, we sum the f, over all characters 
xy modulo N, reverse the order of summation over d and x, and obtain: 


1 
Pl saan 2 K(4) S| LYadi 
DI a BN) i 
which is equal to f, because the inner sum is | if d= 1 and 0 otherwise. 
Thus, f can be written as a sum of functions in M,(N, 7), as claimed. Oo 


Notice that M,(N, y) = 0 if y has a different parity from k, i.e., ify(—1) 4 
(—1)*. This follows by taking y = —/ in the definition (3.29) and recalling 
that f\[—1]}, = (Df. 

For example, as an immediate corollary of Proposition 28 and the pre- 
ceding remark, we have 


Proposition 29. 
M,(4, 1), k even; 


MT, (4)) = ee gy ead: 


where 1 denotes the trivial character and ~ the unique nontrivial character 
modulo 4. 


Notice that the relationship in (3.29) is multiplicative in y; that is, if it 
holds for y, and y,, then it holds for their product. Thus, as in the case of 
modular forms without character, to show that f(z) is in M,(N, x) it suffices 
to check the transformation rule on a set of elements that generate I')(N). 

As another example, we look at ©7(z) =(Z,ezq")*, whose n-th q- 
expansion coefficient is the number of ways 7 can be written as a sum of two 
squares. 


Proposition 30. ©? ¢ M, (I, (4)) = M,(4, x), where x(a) = (— 1)”. 


Proor. It suffices to verify the transformation rule for —J, T, and S rs= 
(-1 _°), which generate I')(4) (see Problem 13 of §HI.1). This is immediate 
for T, since ©? has period 1. Next, the relation f|[—/], = —f=x(-Df 
holds for any /, by definition. So it remains to treat the case ST*S. Let 


0 —-i 7 1 
ow aan i so that Oy! = 55 Ons 


and an(¢ ) ty = ( 4 =) 
c d — Nb a 
We write ST*S = —a,Taz! = 40,704, and use the relationship ©7|[a,], = 
—i@? (see (3.5)) to obtain 
©7|[ST*S], = ©? |[o47e4], = —1©7?|[Ta,], = —107|[o,], = —O?. 


(Recall that the scalar matrix 4/ acts trivially on all functions, i.e., [1/4], = 
identity.) 


(3.30) 
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To finish the proof of the proposition, we must show the cusp condition, 
i.e., that ©7|[yo], is finite at infinity for all y>¢I’. But the square of ©7|[yo], 
is ©*|[yo]>, and it will be shown in Problem 11 below that ©* e M,(I')(4)); in 
particular, this means that ©*|[y)],, and hence also ©7|[y0],, is finite at 
infinity. This completes the proof. oO 


The spaces M,(N, x) include many of the most important examples of 
modular forms, and will be our basic object of study in several of the sections 
that follow. We also introduce the notation S,(N, x) to denote the subspace 
of cusp forms: S,(N, 4) = M,(N, x) 0 S,(1,(N)). 


def 

The Mellin transform of a modular form. Suppose that f(z) = La,g%, (where 
dw = 7") is a modular form of weight k for a congruence subgroup I’ 
of level N. Further suppose that |a,| = O(n‘) for some constant ce R, i.e., 
that a,/n° is bounded as n > oo. It is not hard to see that the gy-expansion 
coefficients for the Eisenstein series Go"°°” have this property with c = 
k —1 + for any ¢ > 0. For example, in the case I’ = I, the coefficients 
are a constant multiple of o,_,(m), and it is not hard to show that o,_,(n)/ 
n*~!** 50 as n— oo. We shall later show that, if fis a cusp form, we can 
take c= k/2 + ¢. It has been shown (as a consequence of Deligne’s proof 
of the Weil conjectures) that one can actually do better, and take c= 
(A —1)/2 +6. 

In Chapter II we saw that the Mellin transform of @(t) = Ze~-™” and 
certain generalizations are useful in investigating some important Dirichlet 
series, such as the Riemann zeta-function, Dirichlet Z-functions, and the 
Hasse—Weil L-function of the elliptic curves E,: y? = x? — n?x. We now 
look at the Mellin transform for modular forms. 

Because we use a variable z in the upper half-plane rather than ¢ (e.g., 
t = —2iz), we define the Mellin transform by integrating along the positive 
imaginary axis rather than the positive real axis. 

The most important case is 7’ = I’, (N). For now we shall also assume that 
J(#%) =0. Thus, let f(z) = 27, a,q"e¢M,(T,(N)). (Recall that since Te 
I’, (NV), we have an expansion in powers of g = e?”” rather than qy.) We set 


az |” Sf(z)z*" dz. (3.31) 
0 


We now show that if /(z) = £2, a,q" with |a,| = O(n‘), then the integral 
g(s) defined in (3.31) converges for Res >c+1: 


ico co) ico . dz 
Sf(z)z51dz = a, ze2ning 2° 
c¢) 0 


n=1 
re ~ 2. dt: s (°° % : _,dt _ . 
7 2, an( tz) [ rte (where t = —2zinz) 
— (—2zi)‘T(s) ‘ a,n * (see (4.6) of Ch. Il) 
n=1 


(3.32) 
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(where the use of I in the gamma-function I'(s) has no relation to its use 
in the notation for congruence subgroups; but in practice the use of the same 
letter [ should not cause any confusion). Since |a,n~*| = O(n“ **°), this 
last sum is absolutely convergent (and the interchanging of the order of 
integration and summation was justified). 

If f(z) = 2%, a,q"EM,(0,(N)) has ay #0, we replace f(z) by f(z) — ao 
in (3.31). In either case, we then obtain g(s) = (—2z/) ‘T'(s)L,(s), where 


L()= Yan for Res>c+1, if f@= yaa" 
i a (3.33) 
with |a,| = O(n‘). 


In addition to their invariance under [y], for yeI',(N), many modular 
forms also transform nicely under [oy],, where %y =(x ©) as in (3.30). 
It will be shown in the exercises that, for example, any function in M,(N, x) 
for x a real character (i.e., its values are +1) can be written as a sum of 
two functions satisfying 


f\lovk= Cif C=lor-l, (3.34) 


where one of the functions satisfies (3.34) with C = 1 and the other with 
C = —1. An example we already know of a function satisfying (3.34) is 
©2: the relation (3.5) is a special case of (3.34) withk = 1,C=1,N=4. 

We now show that if (3.34) holds, then we have a functional equation 
for the corresponding Mellin transform g(s) which relates g(s) to g(k — 5). 
For simplicity, we shall again suppose that f(z) = Za,g" with ay = 0. We 
can write (3.34) explicitly as-follows, by the definition of [ay ],: 


f(—1Nz) = CN~¥2(—iNz)f(2). (3.35) 


In (3.31), we break up the integral into the part from 0 to i], JN and the 
part from i/,/N to ico. We choose i/,/N because it is the fixed point in H 
of ay: z+» —1/Nz. We have 


g(s) = | © foe S - | * f-UN2)(— 12 UND) 
ij/N 7 iN —1/Nz 
= | (f(z)z° + f= 1/N2)(—1N2)) 2 
if /N 


« | (fleyz! + HON-*f(2(—1)NyE, 
WN e 
because of (3.35). 

In the first place, this integral converges to an entire function of s, because 
f(z) decreases exponentially as z > ioo. That is, because the lower limit of 
integration has been moved away from zero, we no longer have to worry 
about the behavior of the integrand near 0. (Compare with the proof of 
Proposition 13 in Chapter II, where we used a similar technique to find a 
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rapidly convergent series for the critical value of the Hasse—Weil L-function ; 
see the remark following equation (6.7) in §II.6.) 

Moreover, if we replace s by k —s in the last integral and factor out 
ikCN*?(—N)%, we obtain: 
g(k-—s)= ren-s(—Ny"| (i-*CN*?(— N)Sf(z)z*5 4/22 
i,/N 


= i*CN-*?(—N)° | 7 (i*CN~*?f(z)(—1/Nz)* + fz e 
iN . 


= i*CN-*?(—N)°g(s), 


because the last integral is the same as our earlier integral for g(s). This 
equality can be written in the form 


(—i/N)'q(s) = C(-i NY gk — 5). 
Thus, by (3.32)—(3.33), if we define A(s) for Re s > c + 1 by 


A(s) = (~iJN)'9(s) = G/N/20)T (9) Ls), (3.36) 


we have shown that A(s) extends to an entire function of s, and satisfies 
the functional equation 


A(s) = CA(k — 5). (3.37) 


As an example of this result, we can take f(z) = A(z)eS,2(F), which 
satisfies (3.35) with N = 1, k = 12, C= 1. Then A(z) = Ep, t(n)q", Lals) = 
m2, t(n)n-s, and A(s) = (2x) I (s)La(s) satisfies the relation: A(s) = 
A(i2 — 5). 

The derivation of (3.37) from (3.34) indicates a close connection between 
Dirichlet series with a functional equation and modular forms. We came 
across Dirichlet series with a functional equation in a very different context 
in Chapter I]. Namely, the Hasse—Weil L-function of the elliptic curve 
E,: y? =x? —n?x satisfies (3.36)—-(3.37) with k = 2, N = 32n’ for n odd 
and 16n? for n even, C = (2) for n odd and (3) for n even (see (5.10)— 
(5.12) in Ch. II). We also saw that the Hasse-Weil L-function of the elliptic 
curve y? = x° + 16 satisfies (3.36)—(3.37) with k = 2, N= 27, C= 1 (see 
Problem 8(d) of §II.5). 

So the question naturally arises: Can one go the other way? Does every 
Dirichlet series with the right type of functional equation come from some 
modular form, i.e., is it of the form L,(s) for some modular form f? In 
particular, can the Hasse—Weil L-functions we studied in Chapter II be 
obtained by taking the Mellin transform of a suitable modular form of 
weight 2? That is, if we write L(E,, s) in the form LR_, b,,m~* (see (5.3) in 
Ch. ID, is £2_, 5,,q™ the g-expansion of a weight two modular form? 

Hecke [1936] and Weil [1967] showed that the answer to these questions 
is basically yes, but with some qualifications. We shall not give the details, 
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which are available in [Ogg 1969], but shall only outline the situation and 
state Weil’s fundamental theorem on the subject. 

Suppose that L(s) = La,n™* satisfies (3.36)—-(3.37) (and a suitable hy- 
pothesis about convergence). Using the “‘inverse Mellin transform”, one 
can reverse the steps that led to (3.36)—(3.37), and find that f(z) = La,q" 
satisfies (3.34). For now, let us suppose that N = 4? is a perfect square, 
and that C = i* (k even). Then, if f(z) satisfies (3.35), it follows that f, (z) = 
f(z/d) = Xa,q7;, satisfies: f,(—1/z) = z*f,(z). Thus, f, is invariant under [S'], 
and [7’],, and hence is invariant under the group generated by S and 7’. 
Hecke denoted that group G(A). We have encountered the group (2) before. 

In this way one can show, for example, that L(E,,,,, 5) corresponds to a 
modular form (actually, a cusp form) of weight 2 for 6(8n,). 

Unfortunately, however, Hecke’s groups G(A) turn out not to be large 
enough to work with satisfactorily. In general, they are not congruence 
subgroups. (6(2) > I'(2) is an exception.) 

But one can do much better. Weil showed, roughly speaking, that if one 
has functional equations analogous to (3.36)—(3.37) for enough “‘twists”’ 
Xx(n)a,n * of the Dirichlet series La,n~*, then the corresponding q- 
expansion is in M,(I')(4V)). We now give a more precise statement of Weil’s 
theorem. 

Let 7 be-a fixed Dirichlet character modulo N (7 is allowed to be the 
trivial character). Let y be a variable Dirichlet character of conductor m, 
where m is either an odd prime not dividing N, or else 4 (we allow m = 4 
only if NV is odd). By a “large” set of values of m we shall mean that the set 
contains at least one m in any given arithmetic progression {u + jv}j.z, 
where u and v are relatively prime. According to Dirichlet’s theorem, any 
such arithmetic progression contains a prime; thus, a “large” set of primes 
is one which satisfies (this weak form of) Dirichlet’s theorem. By a “‘large”’ 
set of characters y we shall mean the set of all nontrivial y modulo m for a 
“large” set of m. 

Let C= +1, and for any x of conductor m set 


C, = Croll x(-N)g Cola, (3.38) 
where g(x) = 27, x(/)e?™” is the Gauss sum. Given a q-expansion f(z) = 
0 ang", g =e?™*, for which |a,| = O(n‘), we define L,(s) by (3.33) and 
A(s) by (3.36), and we further define 


LG 5)= LY xan; AG, 8) = (m,/N/2n)T()L 5). (3.39) 
n=1 
Weil’s Theorem. Suppose that f(z) = =% 94,9", q = e°""*, has the property 


that |a,| = O(n‘), ceR. Suppose that for C=1 or —1 the function A(s) 
defined by (3.36) has the property that A(s) + do(1/s + C/(k — s)) extends 
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to an entire function which is bounded in any vertical strip of the complex 
plane, and satisfies the functional equation A(s) = CA(k — s). Further suppose 
that for a “‘large’’ set of characters y of conductor m (in the sense explained 
above), the function A(y, s) defined by (3.39) extends to an entire function 
which is bounded in any vertical strip, and satisfies the functional equation 
AG 5) = C,AQ, k — 5), with C, defined in (3.38). 

Then fe M,(N, Xo), and f satisfies (3.34). If, in addition, L s(S) Converges 
absolutely for Re s > k — & for some ¢ > 0, then f is a cusp form. 


One can show that the Hasse—Weil L-functions of the elliptic curves in 
Chapter II satisfy the hypotheses of Weil’s theorem (with y, = 1). The same 
techniques as in the proof of the theorem in §II.5 can be used to show this. 
However, one must consider the Hecke L-series obtained in (5.6) of Ch. II 
by replacing 7,(/) by the character 7,(/) (NJ) with y any Dirichlet character 
modulo m as in Weil’s theorem. For example, if we do this for L(E,, s), 
where £, is the elliptic curve y? = x3 — x, wecan conclude by Weil’s theorem 
that 


Se, (2) = 4 — 2¢° — 3g? + 6q'3 + 2g'7 + Y b,qQ™ (3.40) 
m>25 
(see (5.4) of Ch. II) is a cusp form of weight two for I'(32). 

If we form the g-expansion corresponding to the L-series of E,: y? = 
x3 — n?x, namely, fe, (2) = X7,(m)bnq™, it turns out that fp, € M2 (I'9(32n’)) 
for n odd and Se, € Mz(T(16n7)) for n even. Note that when n = 1 mod 4, 
so that y,, is a character of conductor n, this is an immediate consequence of 
the fact that f,, € M,(I'9(32)), by Proposition 17(b). 

More generally, it can be shown that the Hasse—Weil Z-function for any 
elliptic curve with complex multiplication satisfies the hypotheses of Weil’s 
theorem with k = 2, and so corresponds to a weight two modular form 
(actually, a cusp form) for T'9(4). (N is the so-called “conductor” of the 
elliptic curve.) 

Many elliptic curves without complex multiplication are also known to 
have this property. In fact, it was conjectured (by Taniyama and Weil) that 
every elliptic curve defined over the rational numbers has L-function which 
satisfies Weil’s theorem for some N. Geometrically, the cusp forms of 
weight two can be regarded as holomorphic differential forms on the 
Riemann surface I',(N)\H (i.e., the fundamental domain with T)(4)- 
equivalent boundary sides identified and the cusps included). The Taniyama— 
Weil conjecture then can be shown to take the form: every elliptic curve 
over Q can be obtained as a quotient of the Jacobian of some such Riemann 
surface. 

For more information about the correspondence between modular forms 
and Dirichlet series, see [Hecke 1981], [Weil 1967], [Ogg 1969], and 
[Shimura 1971]. 
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Finally, we mention a dramatic and surprising result of G. Frey, J.-P. Serre, 
and K. Ribet: The Taniyama—Weil conjecture implies Fermat’s Last Theorem. 
For an account, see [Oesterlé 1988]. 


PROBLEMS 


1. Let a€GL3(Q), and let g(z) = f(az). Let y = (¢ 2)eT. Notice that f(«z)|[y], was 
defined to be (caz + d) *f(yaz), which is not the same as g(z)|[y], = (cz + d)™* 
S(«yz). Show that if « =(§ 9), ie., if az =nz, then g(z)|[y], = f(az)|[Lopo], = 
f(nz)|UGu 2) dee 


2. Let I’ be a congruence subgroup of I of level N, and denote Ij = {yeI’|ys = 5} 
for seQu {co}. Lets = a1 00, wel. 
(a) Prove that ofa7! = (aI’«"'),. 
(b) Show that there exists a unique positive integer A (called the “ramification 
index” of I’ at s) such that 


(i) in the case —/eT’ 


l= +07 {7} 12%; 


(ii) in the case —/¢T’ either 
Ti =a {7 7a; or (IIa) 


Ty =a ((-T' next. (Ib) 


Show that / is a divisor of N. 

(c) Show that the integer A and the type (I, Ila, or IIb) of s does not depend on the 
choice of ae with s = a7!00; and they only depend on the I’’-equivalence 
class of s. 

(d) Show that if «~! oo is of type I or Ila and fe M,(T’), then f|[«7"], has a Fourier 
expansion in powers of q,. A cusp of I’ is called “regular” if it is of type I or 
Ila; it is called “irregular” if it is of type IIb. 

(ce) Show that if «~!oo is an irregular cusp, and feM,(I’), then f|[a~"], has a 
Fourier expansion in powers of q., in which only odd powers appear if k is odd 
and only even powers appear if & is even. If k is odd, note that this means that 
to show that fe M,(I’’) is a cusp form one need only check the g-expansions at 
the regular cusps. 


3. Let A be any positive integer, and suppose 2h|N, N > 4. Let I’ be the following 
level N congruence subgroup: I’ = {(@ 5)=(~3 =*)) mod N for some j}. Show 
that 00 is a cusp of type IIb. 


4. (a) Show that I’, (V) has the same cusps as I')(V) for N = 3, 4. 
(b) Note that —/¢1I°,(N) for N > 2. Which of the cusps of T, (3) and I, (4), if any, 
are irregular? 


5. Find the ramification indices of I’ at all of its cusps when: 
(a) T° =To(p) (pa prime); 
(b) 1’ = To(p*); 
(c) Fr’ =F). 
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Il. 


12. 


. Prove that if T’ < T is a normal subgroup, then all cusps have the same ramifica- 


tion index, namely [T,,: +I]. 


. (a) Show that any weight zero modular function for I’ < I satisfies a polynomial 


of degree [T': I’’] over the field C(/) of weight zero modular functions for I. 

(b) Show that, if I’ is a normal subgroup, and if f(z) is T-invariant, then so is 
f(az) for any wel. Then show that the field of weight zero modular functions 
for I’ is a galois field extension of €(j) whose galois group is a quotient of 
I'/I’. In practice (e.g., if I’ is a congruence subgroup), it can be shown that the 
galois group is equal to T/T’. 


. Prove the following identities, which will be useful in the problems that follow and 


in the next section, by manipulation of power series and products in g = e?*?: 
(a) O(z) + O(z + 4) = 20(4z); 
(b) E,(z + 3) — Ey (Zz) = 48 Loaans0 01)"; 
(c) E,(z)- (+ pX)E,(pz) + DY E,(p?z) = 5 Daly O,-1(")q" 
(k => 2, p prime); 
(d) E,(z) — 3£,(2z) + 2E,(4z) = 3(E,(z) — E,(z + 4); 
(e) n(z + 5) = e7™4899(2z)/n(z)n(4z). 


. Prove that if k is even and f(z) has period one and satisfies f(—1/4z) = (—4z?)¥? 


S(z), then f|[y], =f for all yeT,(4). 


. (a) Prove that 4(4z)/n*(2z) € M,(I'o(4)), and find its value at each cusp. 


(b) For aéZ prove that E,(ST~*Sz) = (az + 1)? E,(z) — ®#(az + 1). 

(c) Let F(z) = —34(E,(z) — 3E,(2z) + 2E,(4z)) = Logans0 0 (")Q" by Problem 
8(c). Prove that F(z)€M,(I(4)), and find its value at each cusp. 

(d) Prove that F(z) = y8(4z)/n*(2z). Then derive the identity 


q{I]a-@*yd+a = YY a(n)". 
n=1 odd n>0O 
(e) Give a different proof that —24F(z) = $(£,(z) — E,(z + 1/2)) is in M(T(4)) 
by proving that, more generally, E,(z) — y Uo E,(z + j/N) is in M,(T(N2)). 


(a) Prove that @(z)*e M,(I'o(4)), and find its value at each cusp. 

(b) Show that @(z)* and F(z) (see preceding problem) are linearly independent. 
(c) Prove that 4?°(2z)/n*(z)y(4z) € M,(I'o(4)), and find its value at each cusp. 
(d) Prove that O(z) = 4°(2z)/n?(z)y?(4z). 

(e) Prove that O(z) =e ?"?4y?(z + 4)/n(2z). 


Let N = 7 or 23, and let k = 24/(N + 1). Let x be the Legendre symbol y(n) = (£). 
Prove that any nonzero element of S,(N, x) must be a constant multiple of 
(n(z)n(Nz))*. 


. Using Propositions 25-27, prove that (y(z)9(3z))° €S¢(M(3)) and (n(z)n(7z))2€ 


S3(7, x) where y(n) = (§). 


. Let (2) = 2,707" = O(z/2). Let y be the unique nontrivial character of G(2)/ 


1(2) (which has 2 elements). Show that ¢*¢ M,(G(2), x). 


. Let fe M,(N, x), and set ay = (2 ~}). 


(a) Prove that f|[ay],¢M,(N, x), and that the map f+ |[oy ], is an isomorphism 
of vector spaces from M,(N, x) to M,(N, 7). Prove that the square of this map 
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(i.e., where one uses it to go from M,(N, x) to M,(N, z) and then again to go 
from M,(N, 7%) back to M,(N, x)) is the map (—1)* on M,(N, y). 

(b) If x=, ie., if x takes only the values +1, then prove that M,(N, xy) = 
M,N, ~)®M, (Ny), where Me (N, ) = {feM(N, Df lLavk = £74}. 
In other words, any modular form in M,(N, x) can be written as a sum of one 
which is fixed under i*[ ay], and one which is taken to its negative by i*[ay ],. 

(c) Let N = 4. In Problem 17(d) below, we shall see that ©* and Fspan M,(T(4)) = 
M,(4, 1) (where F = 2,440, (")q” as in Problem 10, and 1 denotes the trivial 
character in M,(4, 1)). Assuming this, find the matrix of [«,], in the basis ©4, 
F; show that M,°(4, 1) and M; (4, 1) are each one-dimensional; and find a 
basis for M,(T'9(4)),consisting of eigenforms for [#4]. If you normalize these 
eigenvectors by requiring the coefficient of g in their g-expansions to be 1, then 
they are uniquely determined. 


. (a) For k => 2 even, let f(z) = Ske = age +2, o,-,(a)q". Express L,(s) (see 


the definition in (3.33)) in terms of the Riemann zeta-function. 

(b) Write an Euler product for L,(s). 

(c) Let f(z) = D2, o-1(m)x(n)q" for a Dirichlet character y. Write an Euler 
product for Ly). 


Let F(2) be the fundamental domain for I'(2) constructed in §1 (see Fig. 111.3). Then 

F’ = aF(2), where « = (2 ~3), is a fundamental domain for I')(4) = oI (2)a7! (see 

Problem 10 in §III.1). The boundary of F’ consists of: two vertical lines extending 

from (—3 + i,/3)/4 and from (1 + i./3)/4 to infinity; two arcs of circles of radius 

4, one centered at —4 and one centered at 0; the arc of the circle of radius ¢ and 

center § which extends from 0 to (9 + i,/3)/28; and the arc of the circle of radius 

qo and center 7 which extends from (9 + i,/3)/28 to 4. Consider T'y(4)-equivalent 

points on the boundary of F’ to be identified. 

(a) Find all elliptic points in F’ (i.e., points which are '-equivalent to i or w = 
(-—1+ i,/3)/2). Which are on the boundary and which are in the interior of F’? 

(b) Let f(z) be a nonzero modular function of weight k (k eZ even) for I')(4). Let 
v,(f) denote the order of zero or pole of f(z) at the point P. At a cusp P = 
a~'oo, we define v,(/) to be the first power of g, with nonzero coefficient in 
the Fourier expansion of f|[«~'], (where h is the ramification index; see 
Problem 2 above). Prove that: 2,.,-v,(f) = k/2, where the summation is over 
all points in the fundamental domain F’, including the three cusps, but taking 
only one point in a set of I')(4)-equivalent boundary points (e.g., {—3 + iy, 
bt iy} or (2 +08, 4409, Se +), 

(c) Describe the zeros of @(z)* and F(z) (see Problems 10-11 above). 

(d) Prove that ©* and F span M,(T,(4)). 

(e) Prove that M,(I')(4)) = 0 if k < 0, and it contains only the constants if k = 0. 

(f) Prove that for k = 2k, a nonnegative even integer, any fe M,(T)(4)) can be 
written as a homogeneous polynomial of degree ky in F and ©*. 

(g) Prove that S,(I9(4)) is one-dimensional and is spanned by O° F — 160*F?. 

(h) Prove that n7(2z) dé S.(Io(2)), but that n17(2z) € S;(Io(4)). Then conclude that 
n'7(2z) = OF F— 1604 F?. 

(i) Prove that for k = 2k, = 6, any fe S,(I'9(4)) can be written as a homogeneous 
polynomial of degree ky in F and ©* that is divisible by O* F(©* — 16F). 


(a) If fe M,,(N, 71) and ge M,,(N, 72), show that fge My, 44,(N, 2142): 
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(b) Let y be the unique nontrivial character of (Z/4Z)*. Show that any element of 
M, (4, y) is a constant multiple of ©?. 

(c) With x as in part (b), find a formula for dim S,(T,(4)) = dim S,(4, x"). 

(d) Let f(z) = (n(z)n(2z))8, and let g(z) = f(2z). Show that S, (I, (4)) is spanned by 
fand g. 

19. Let P’ < F be a congruence subgroup, with T = (Ja,I’, so that F’ = Jo; *F isa 
fundamental domain for I’. Let f; = f|[o;*], for fe M,(I’’). Suppose that fe 5,1), 
so that f,(z) = Zy21 ay, idhp where fh; is the ramification index of I’ at the cusp 
5; = a; 00. In particular, f(z) = L721 a,g; at the cusp oo. 

(a) Show that there exists a constant C independent of j and x such that 


[f(x + iy)| < Ce?) for y>e. 


(b) Let g(z) = (Im z)*?| f(z)|. Show that g(yz) = (Im z)*?| f(z)|[y],| for yeT. 
(c) Show that g,(z) = (Im z)*?| f;(z)| is bounded on F. 

(d) Show that g(z) is bounded on F’. 

(e) Show that g(z) is bounded on H. 

(f) Show that for any fixed y: 


h 


0 


h 
dy = Al f(xt+ iy)e 2rina tiny. 


(g) Show that there exists a constant C, such that for all y: 
la,| < Cy "027", 


(h) Choosing y = 1/n in part (g), show that for C, = C,e?™": |a,| < Cyn*?. 
(i) Show that |a,, |J7~"? is similarly bounded for each j. 
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We first define some notation. Let d be an odd integer, and let c be any 
integer. The quadratic residue symbol (4) is defined in the usual way when 
d is a (positive) prime number, i.e., it equals 0 if dc, 1 if c is a nonzero 
quadratic residue modulo d, and —1 otherwise. We extend this definition 
to arbitrary odd d as follows. First, if g.c.d.(c, d) > 1, then always (4) = 0. 
Next, if d is positive, we write d as a product of primes d = IT; p; (not neces- 
sarily distinct), and define (7) = N,Q). If d= +1 and c = 0, we adopt the 
convention that (#7) = 1. Finally, if d is negative, then we define (4) = (ja) 
ifc > O and (4) = —(jq) ife <0. 

It is easy to check that this quadratic residue symbol is bimultiplicative, 
i.e., it is multiplicative in c if d is held fixed and multiplicative in d if c is 
held fixed. It is also periodic with period d when d is positive: (44%) = (9) 
if d > 0. However, one must be careful, because periodicity fails when d is 
negative and c + dand c have different signs: (4%) = —(ife >O>c+d. 
This is because of our convention that (7) = —(Gj) when both c and d are 
negative. On the other hand, this convention ensures that the usual formula 
=) = (—1)*"!? holds whether d is positive or negative. 
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Next, we adopt the convention that ./z for ze C always denotes the branch 
whose argument is in the interval (— 2/2, 2/2]. We next define ¢, for d odd by: 


&a = V()), ie., 


(4.1) 


- 1 ifd=1mod4; 
“a; ifd=3 mod 4. 


With these definitions we finally define the ‘‘automorphy factor” /(y, z), 
which depends on y = (4 °)eI)(4) and zeH: 


a b 
IQs 2) & (S)rve +d for y= (’ i) eTot4) zeH. (4.2) 


Recall our definition of the theta-function @(z) = 2,79" = Zye7 27”. 
The purpose of this section is to prove the following theorem, following 
Hecke [1944]. 


Theorem. For y€1,(4) andzeH 
O(7z) =J0, DOW), (4.3) 
where j(y, Z) is defined by (4.2). 


Notice that the square of /(y, z) is (G@)(cz + d), and so the square of the 
equality is precisely what we proved in Proposition 30. Thus, for fixed 
yéI,(4) the ratio of the two sides of (4.3) is a holomorphic function of 
z€ H whose square is identically 1. Thus, the ratio itself is +1. The content 
of the theorem is that this ratio is +1, i.e., that /(y, z) has the right sign. 

Simple as that sounds, the theorem is by no means trivial to prove. At 
first, it might seem sensible to proceed as in the proof of Proposition 30, 
proving that (4.3) holds for generators of I9(4). However, then we would 
have to show that the expression /(y, z) in (4.2) has a certain multiplicative 
property which ensures that, if (4.3) holds for y, and y,, then it must hold 
for ), 72. But that is a mess to try to show directly. We shall, in fact, conclude 
such a property for j(y, z) as a consequence of the theorem (see Problem 3 
at the end of this section). 

In proving the theorem, it turns out to be easier to work with the function 
1 @(z/2) = X,-7 e™"”*, which we encountered in Problem 14 of SHI.3. 
This function satisfies: @(Tz) = $(z) (obvious from the definition) and 
(Sz) = ./ —izp(z) (immediate from (3.4)). Hence, (yz) has a transforma- 
tion rule for any y in the group (2) generated by +77’, S. It is because 
(2) is such a large group—having only index 3 in T—that it is sometimes 
easier to work with @. The corresponding group under which ©(z) = $(«z), 
o# =(2 °), has a transformation rule is «~1@(2)« (see Problem 1 of §III.3). 
But « 'G(2)a is not contained in Fr = SL,(Z); its intersection with I is the 
subgroup I',(4) of index six in’. Thus, we can work with a “larger” subgroup 
of T (i.e., its index is smaller) if we work with ¢(z) rather than ©(z). The 
next lemma gives an equivalent form of the theorem in terms of #(z). 


Lemma 1. The theorem follows if we prove the following transformation 


formula for (2): 
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(yz) = i? (2). /—i(cz + d)d(z) (4.4) 
for yeT such that y = (4 *) =(_° 3) (mod 2) with d # 0. 


Proor. Suppose that we have the transformation rule for ¢(z). We must 
show that © satisfies (4.3) for all y= (2 5)€I,(4). We first note that if 
c=0, then (4.3) holds trivially, since in that case @(yz) = O(z), while 
JQ, Z) = & id = 1 (since d= +1). So in what follows we suppose that 
c #0. 

For any y = (4 °)eT,(4) with c 4 0 we write 


_ az+b\ _ ,f 2b(—1/2z)—a\ _ sli 
O02) = 4 (o% + 2) = o( Aa =) = $('(-1/22)), 


where y’ = (77 —) and y’ =(_? 4) (mod 2), because 4|c. We apply (4.4) 
with y’ in place of y and — 1/2z in place of z. Using the fact that (=$2) = (=4)(5), 
we obtain 


O(yz) = i -n( “V(< Jy —i(d(—1/2z) — c/2)¢(—1/2z). 


Next, we have $(— 1/2z) = @(—1/4z) = ./ —2iz@(z) by (3.4). The product 
of the two square root terms is +./cz + d (note that, because of our conven- 
tion on the branch of the square root, we have ,/x,/y = +./xy; for example, 
J/-1,/—1= —,/1). But since the three functions ./—i(d(— 1/2z) — ¢/2), 
—2iz, and ,/cz + d are all holomorphic on H, the + must be the same 
for all z; so it suffices to check for any one value of z, say z = i. But in that 
case ,/—2iz = ,/2, and ,/x,/y = +,/xy always holds when y is positive real. 
Thus, the product of the two square root terms is ./cz + d, and we have 


O(yz) = i4 -1ia(=2 ’) é \ve + dQ@Q(z). 


To complete the proof of Lemma 1, it remains to check that i{1~®”?(=?) = e7', 
which we easily do by considering the cases d = 1, 3, 5, 7 (mod 8). oO 


The remainder of this section is devoted to proving (4.4). 
For a fixed odd prime p, let us denote 


W(z) = n?(z)/n(pz), (4.5) 


where n(z) is the Dedekind eta-function, as in §2 and §3. According to 
Propositions 26-27, we have We My p-12(P, (5)), Le., W? is a modular 
form for I'o(p) with character y(d) = (4). This is the basic tool which will 
be used to prove (4.4). But it will take several lemmas to relate y? and @. 


Lemma 2. 
o(p2)/0"@) = WO? (2). (4.6) 
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Proor. By Problem 11(e) in the preceding section, with z replaced by 3 and 
by 4, we find that the left side of (4.6) is equal to 


e224? (PY /m(pz) _ yup-vyi24)2mi_W)_ Pt) 
(e749? SY /n(2))? wr) w(py) 
But since (z+ 1) = e?7!?4n(z), and p((z + 1)/2)=(pz+ 1/2 +47, it 
follows that the last term on the right is e-‘?-')?*?*. This proves (4.6). © 


Lemma 3. Let p be an odd prime, let y = (2 °)€G(2) NT 9(p), and let f(z) = 
Wr. Then f |[y]p-1y2 = (pf. 


Proor. Let « =(j 3), so that w((z + 1/2) = w(az). Then w((yz + 1)/2) = 
W(ayz) = W((apa')az). Now for y as in the lemma, we have 


b+d—a-—c)/2 
eee ‘ +c (b+d-—a-—c)/ ero. 
2c d—-c 


(Note that b+ d—a-—c is divisible by 2 because y¢G(2).) Hence, by 
Propositions 26-27, we have 


y? (eH) = (S ‘} (2caz + d— cP YP Wy?(az) 


2 
-()c2eapronye(e1) 


because d — c = d (mod p). This is the relation asserted in the lemma. Oo 


Lemma 4. Let p be an odd prime, let y = (2 *)€G(2) NT o(p), and let g(z) = 
(pz)/?(z). Then g|Ly]u-py2 = @)9- 


Proor. We first claim that g® transforms trivially under y. Let «= (8 9), 
and let y’ = wya”!. Then both y and »’ are in G(2), and we can use Problem 14 
of the preceding section to compute 


9°(z)|Ly]aa—p = (ez + d)*?) 68’ az)/?"(yz) 
_ Gaz +d) *o*(y'az) 
(cz + d)*?h8"(yz) 

_ $%(22) 

p*?(z) 


as claimed. Meanwhile, the ninth power of g transforms under y by ($), as 
we see by raising both sides of (4.6) to the 9th power and using Propositions 
26-27 and Lemma 3 (here f(z) is as in Lemma 3): 


W?*|Drdse-n2)* _ (WP _ (f\e. 
(f|0) 3(p-19/2) (()f)° P 


= g*(z), 


9° |Ly oct -py2 = (W/f)|Ly ou -py2 = 
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Taking the quotient of these two relationships gives 


d 
[7 la-py2 = 9°|LyJoa-pya/9* Lr Jaa» a (‘)a 7 


The next lemma generalizes Lemma 4 by replacing the prime p by an 
arbitrary positive odd number n. 


Lemma 5. Let n be a positive odd integer, let y = (2 *>)€ G(2) AT 9(n), and let 
g(Z) = b(nz)/p"(z). Then Ly -ny2 = (1g. 


PROOF. We write n= p,---p, as a product of primes (not necessarily 
distinct), and we use induction on the number r of prime factors. Lemma 4 
is the case r = 1. Now suppose we know Lemma 5 for n; we shall prove the 
corresponding equality for a product n’ = np of r+ 1 primes. We write 


G(n'z) _ p(naz) (b(p2)\ 
o"@) ~ o"az) Gar 0 


where « = (§ $). For y = (¢ 4) €To(n’) 9 G(2) we have y’ = aya = (7, “Pye 
Ty(”) ~ G(2), and so, by the induction assumption, 


1—n)/2 
p(nayz)/p"(ayz) = p(ny’az)/h"(y’az) = (3) (<x: + a) p(naz)/p"(az) 
= (2 )ec +d)??? 6(naz)/b"(az). 


In addition, by Lemma 4, we have 


b(pyz)/P"(9z) = (S\ecz + dy"? b( pz)/p"(z). 


Combining these two relations, we see that replacing z by yz in (4.7) has the 
effect of multiplying by 


(“Jee 4+ dyi-m (Se je apm) - (2)()tc 4 dyin 
= (2) (cz + dyA-"2, 


This completes the induction step, and the proof of the lemma. Oo 


We are now ready to prove (4.4). We first note that both sides of (4.4) 
remain unchanged if y is replaced by —y (see Problem 2 below). Hence, 
without loss of generality we may suppose that y = (4 5) =(_° 3) mod 2 
with c > 0. 

We now apply Lemma 5 with x replaced by the positive odd integer c, 
obtaining: 
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wes, - (2 )tc s jon (4.8) 
The ratio that ultimately interests us is }(yz)/¢(z). Solving for this in (4.8) 
gives 
On the other hand, we have (using: ad — 1 = bc) 
biped i 
cz+d cz+d 


Since a is even and @ has period 2, this means that 


o(cyz) = ob ( 4) = /—-i(cz + d)o(cz + d) 


ez + 


= ./—i(cz + d) (cz). 


Combining (4.9) and (4.10) gives 


(4.10) 


c 


Meanwhile, we saw in Problem 14 of the preceding section that ¢° is invariant 
under [y], for y¢ G(2), i-e., 


We now raise both sides of (4.11) to the c-th power and divide by (4.12), 
where k is chosen so that c? = 8k + 1. (Since ¢ is odd, of course c? = 1 
mod 8.) The result is: 


oO = (<) (cz + dye12-4%( — (ez + dy), [—i(ez + d). 
But c(c — 1)/2 — 4k + (ec — 1)/2 = (c? — 1)/2 — 4k = 0. Hence, 


oe) z (4) jer (era), 


c 


which is the transformation formula (4.4) that we wanted to prove. This 
concludes the proof of the main theorem as well. Oo 


The transformation formula for the theta-function is similar to the trans- 
formation formula for a modular form of weight k if we take k = 3, i.e., 
except for a power of i the ‘“automorphy factor” is (cz + d)*. In the next 
chapter we shall see that there is a general theory of modular forms whose 
weight is a half-integer, and the transformation formula for O(z) plays a 
fundamental role in describing such functions. 
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PROBLEMS 


1. Prove that the generalized quadratic residue symbol (4) as defined in this section 
satisfies the following form of quadratic reciprocity: if c and d are both odd integers, 
then 


“ (-1)* 4-42) if c or dis positive; 
d= —(-1)6-P@-Y4(S) if and d are negative. 
2. Show directly that both sides of (4.4) remain unchanged if y is replaced by — y. 


3. (a) Show directly (using (3.4)) that the theorem (4.3) holds for the generators — J, 
T, and ST~*S of T(4). 
(b) Show that the theorem would follow from part (a) if one could show that 


(ap, z) = j(%, Bz)j(B, z) forall «, BeETo(4). (4.13) 


(c) Conversely, show that the theorem proved in this section implies the relation 
(4.13). 


§5. The modular interpretation, and Hecke operators 


A basic feature of modular forms is their interpretation as functions on 
lattices. More precisely, we consider the most important cases of a congruence 
subgroup I’: T’ = T,1T,(N), o(N) or P(N). (Of course, F = T, (1) = T9(1) 
= I(1), so everything we say about the cases T',(4V), [)(N) or P(N) will 
apply to I if we set N = 1.) By a “modular point” for I’ we mean: 


(i) for Tl’ =T: a lattice LD oC; 
(ii) for T’ = 1, (4): a pair (LZ, 0), where L is a lattice in C, and te C/L is 
a point of exact order N; 

(iii) for [’ =I 9(N): a pair (L, S), where L is a lattice in C, and Sc C/L 
is a cyclic subgroup of order N, i.e., S = Zt for some point teC/L of 
exact order N. 

(iv) for 7’ = TN): a pair (LZ, {7,, t2}), where ¢,, t,€C/L have the property 
that every te xL/L is of the form ¢ = mt, + ntp, ie., t;, t, forma basis 
for the points of order N (in particular, t, and ¢, must each have exact 
order NV). 


Given a lattice L, in general there will be several modular points of the 
form (L, t), (L, S), or (L, {t,, t2}). However, when N = 1, there is only 
one modular point corresponding to each L, and we identify it with the 
modular point L for T. 

Let keZ. In each case (i)—(iv), we consider complex-valued functions 
F on the set of modular points which are of “weight k” in the following 
sense. If we scale a modular point by a nonzero complex number /, then the 
value of F changes by a factor of 4". That is, for 2€C* we consider AL = 
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{al|leL}, AteC/AL, AS = {At|te S} < C/AL. Then F is defined to be of 
weight k if for all Ae C* 


case (i) F(AL) = A-*F(L) for all modular points L; 
case (ii) F(AL, At) = A-“F(L, t) for all modular points (L, f); 
case (iii) FL, AS) = 4-*F(L, S) for all modular points (L, S); 
case (iv) FAL, {At,, At,}) = 2-*F(L, {t,,t}) for all modular points 


(ZL, {fy, t,}). 
An example of such a function of weight k is 
GAs ¥ (kK > 2 even). (5.1) 


O#leL 


Notice that any function F in case (i), such as G,, automatically gives a 
function for the other groups; for example, by setting F(L, t) = F(Z). 
Given a function F of weight k, we define two corresponding functions 
F and f as follows. F(a) is a complex- valued function on column vectors 
= =) such that w,/m,¢H; f(z) is a function on the upper half-plane 
i. Let L,, be the lattice spanned by m, and w,, and let L, be the lattice 
spanned by z and 1. Given F as above, we define 


case (i) F(w) = F(L,); 
case (ii) F(w) = F(L,,, @,/N); 
case (iii) F(w) = F(L,, Zw,/N); 
case (iv) F(w) = F(L,, {@,/N, w,/N}). 


In all cases we define f(z) = F(;). Thus, for example, the function f(z) 
that corresponds to G,(L) (see (5.1)) is the Eisenstein series we denoted 
G,(z) in §2 (see (2.5)). 

For y = (¢ 5)eT = SL,(Z), we define the action of y on functions of w 
by the rule yF(w) = F(ya), where yw is the usual multiplication of a column 
vector by a matrix. 


Proposition 31. Let k eZ, and let T’=T,1,(N), [o(NV) or P(N). The above 
association of F with F and f gives a one-to-one correspondence between the 
following sets of complex-valued functions: (1) F on modular points which 
have weight k; (2) F on column vectors w which are invariant under y for 
ye’ and satisfy F(Aw) = 4-*F(@); (3) f on H which are invariant under 
Ly]. for yer. 


Proor. We shall treat case (ii), and leave the other cases as exercises. 
Suppose yeI',(N) and F is a weight & function of modular points (Z, 2). 
We first compute: 


Fyw) = F oa cor dea) = F(L,, 2/N) = Flo), 


because La. +b0,,c0,+do, = Lo,,o, (Since yeT) and (cw, + dw,)/N = @,/N 
mod L (since yeT, (V)). We also have 
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F(d@) = F(L,,,, 4@2/N) = 2-*F(L,, ©2/N) = 1-* F(a). 


Next, we have 


1 cz+d 
Sz) = F (Beara x) = (cz + d)‘F Carer etd), 


because F has weight &. But the lattice spanned by az + b and cz +d is 
L,, and (cz + d)/N = } mod L, ; hence 


S(yz) = (cz + dkF (x. x) = (cz + d)¥f(z). 


Thus, the F and f corresponding to F have the properties claimed. 

To show the correspondence in the other direction, given f(z) we define 
F(w) to be w3*f(w,/w>); and given F we define F(L, t) to be F(w), where w 
is chosen to be any basis of L such that w,/N =f mod L. One must first 
check that the definition of F makes sense (i.e., that such a basis w exists), 
and that the definition of F is independent of the choice of such a basis w. 
The first point is routine, using the fact that ¢ has exact order N in C/L, and 
the second point follows immediately because any other such basis must be 
of the form yw with yeT, (4). It is also easy to check that, once fis invariant 
under [y], for yel,(N), it follows that F is invariant under y and has weight 
k;and that, if Fhas weight k, then so does the corresponding F. The construc- 
tion going from F to F to f and the construction going from f to F to F are 
clearly inverse to one another. This concludes the proof. o 


We say that F is a modular function/ modular form/ cusp form if the 
corresponding fis a modular function/ modular form/ cusp form as defined 
in §3. 

We now discuss the Hecke operators acting on modular forms of weight k 
for I,(V). We could define them directly on f(z) ¢ M,(T,(4)). However, the 
definition appears more natural when given in terms of the corresponding 
functions F on modular points. 

Let # denote the Q-vector space of formal finite linear combinations of 
modular points, i.e., Y = @Qe,, is the direct sum of infinitely many 
one-dimensional spaces, one for each pair (L, t), where L is any lattice in 
C and teC/L is any point of exact order N. A linear map T: # > Y can be 
given by describing the image Te, ,= £a,ep, of each basis element; here 
{P,} are a finite set of modular points. 

For each positive integer n we define a linear map 7,: # — ¥ by the 
following formula giving the image of the basis vector e, ,: 


Tele) =2Lee-e (5.2) 


where the summation is over all lattices L’ containing L with index n such 
that (L’, ) is a modular point. (Here for te C/L we still use the letter ¢ to 
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denote the image of t modulo the larger lattice L’.) In other words, L’/L < 
C/L is a subgroup of order n, and ¢ must have exact order N modulo the 
larger lattice L’ as well as modulo L. The latter condition means that the 
only multiples Zt which are in L’/L are the multiples ZN¢r which are in L. 
In the case N= 1, ie., I’ =I, this condition disappears, and we sum 
over all lattices L’ with [L’: L] =n. The condition on ¢ is also empty if 
nand N have no common factor. To see this, suppose that g.c.d.(n, NV) = 1, 
and suppose that N’teL’. Then the order of N’t in L’/L divides N (because 
N’Nte L)and divides n (because # L’/L = n), and so divides g.c.d.(N, n) = 1. 
Thus N’te L. 

Notice that the sum in (5.2) is finite, since any lattice L’ in the sum must 
be contained in 4#L = {#/|leL}, because each element of L’/L has order 
dividing n = #L’/L. Thus, each L’ in the sum corresponds to a subgroup of 
order n in 4L/L & (Z/nZ)?. 

Note that 7, = 1 = the identity map. 

Next, for any positive integer n prime to N we define another linear map 
Fey ee bY 


1 
Th n(x) = yet 1/n)L,t° (5.3) 


Note that ¢ has exact order N modulo +L, because g.c.d.(N, n) = 1. Again 
we are using the same letter ¢ to denote an element in C/L and the corre- 
sponding element in C/5L. 

It is easy to check the commutativity of the operators 


7 =F =T. .T.,% Teele = ToT. (5.4) 


ny,ny~* Ny,Nz nyny,myny Ny,N,*n,,n,? m*n,n* 


It is also true, but not quite so trivial to prove, that the 7,,’s commute with 
one another for different m’s. This will follow from the next proposition. 


Proposition 32. (a) If g.c.d.(m,n) = 1, then Ty, = TT, ; in particular, T,, 
and T,, commute. 

(b) If p is a prime dividing N, then T, = T,. 

(c) If p is a prime not dividing N, then for 1 > 2 


T,! = T,y-1T, — pT,!-2T), p- (5.5) 


ProoF. (a) In the sum (5.2) for T,,,, the L’ correspond to certain subgroups 
S’ of order mn in #iL/L, namely, those which have trivial intersection with 
the subgroup Zt < C/L. Since g.c.d.(m, n) = 1, it follows that any such S’ 
has a unique subgroup S” of order n; if L” > L is the lattice corresponding 
to S”, then S’/S” gives a subgroup of order m in wL”/L”. Both S” and 
S’/S” have nontrivial intersection with Zt. Conversely, given S” = L’/L c 
IL/L of order n and a subgroup S’ = L’/L* c AL’/L” of order m, where 
both subgroups have trivial intersection with Zt, we have a unique subgroup 
L’'/L < @aL/L of order mn with nontrivial intersection with Zt. This shows 
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that the modular points that occur in T,,,(€1,,) = mm €,,,, and in T,,(T7,(é,,1)) 
= 42XT,,(e,,,,) are the same. 

(b) By induction, it suffices to show that 7,1-17, = T, for /> 2. Let 
= 8t. Then T(z...) =p‘ Xe;,,, Where the summation is over all L’ > L 
such that L’/L < p"'L/L has order p' and does not contain t’. Notice that 
L’/L must be cyclic, since otherwise it would contain a (p, p)-subgroup of 
p 'L/L. There is only one such (p, p)-subgroup, namely $L/L, and t’e4L/L, 
since pt’ = Nte L. Once we know that L’/L must be cyclic, we can use the 
same argument as in part (a). Namely, for each L’ that occurs in the sum for 
T,!(é1,1) there is a unique cyclic subgroup of order p in L’/L; the correspond- 
ing lattice L” occurs in the sum for 7,(e;,,), and L’ is one of the lattices that 
occur in 7,1-1(e,.,). This shows the equality in part (b). 

(c) Since p} N, the condition about the order of ¢ in C/L’ is always fulfilled. 
We have 7,)-17,(ér,1) =p 'Z,.Z,-e,.,, where the first summation is over 
all lattices L” such that S” = L”/L has order p, and the second summation 
is over all L’ such that S’ = L’/L” has order p'!. On the other hand, T,(€1,1) 
=p 'Z,,e,,, where the summation is over all L’ such that L’/L has order 
p'. Clearly, every L’ in the inner sum for T,!-1T, is an L’ of the form in the 
sum for 7,1, and every L’ in the latter sum is an L’ of the form in the former 
sum. But we must count how many different pairs L”, L’ in the double sum 
lead to the same L’. First, if L’/L is cyclic, then there is only one possible 
L’. But if L’/L is not cyclic, ie., if L’/L > $L/L, then L” can be an arbitrary 
lattice such that L”/L has order p. Since there are p + | such lattices (for 
example, they are in one-to-one correspondence with the points on the 
projective line over the field of p elements), it follows that there are p extra 
times that e;,,, occurs in the double sum for T7,:-17,,. Thus, 


T(€1,1) = T,!-17,(€1,.) — P a oy Crit: 
L'>(1/p)L 
(L:(1/p)L]= pl 2 
But 
T,1-2T, a T, =p! 
pt 2T,, p(Cn,2) = ZT pl 2€ (1 )py,1 = P » a 
P [L':(1/p)L]= p! 2 

This concludes the proof of part (c). Qo 

If n = pj! --- pris the prime factorization of the positive integer n, then 


Proposition 32(a) says that 7, = Tha see Thx Then parts (b)—(c) show that 
each Tha, is a polynomial in T,, and T, Pj" It is easy to see from this and (5.4) 
that all of the 7,’s commute with each other. Thus, the operators T,,, (n a 
positive integer prime to NM) and T,, (m any positive integer) generate a 
commutative algebra # of linear maps from ¥ to Y; actually, H is gen- 
erated by the 7, , (p{N a prime) and the T, (p any prime). 

There is an elegant way to summarize the relations in Proposition 32 as 
formal power series identities, where the coefficients of the power series 
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are elements in #. First, for p|N, we can restate Proposition 32(b) as 


follows: 
1 


Bat 
Be Te 
ie., (27, X')(1 — T,X) =1 in #[LX]]. This follows from Proposition 
32(b) because, equating coefficients, we see that the coefficient of X' is 
T, — T,!-1T,. Similarly, for p/N, part (c) of Proposition 32 is equivalent 
to the identity 


p\N, (5.6) 


I 
b= Tr 


i.e., if we multiply both sides of (5.7) by 1 — T,X + pT, pX? and equate 
coefficients of powers of X, we see that (5.7) is equivalent to the equalities 


T,=1, T,—T,=0, T,t—Ty-1T)+pTy-2T,,, for 122. 


¥ TX! = PIN, (5.7) 
1=0 


To incorporate part (a) of Proposition 32, we introduce a new variable s 
by putting X = p* for each p in (5.6) or (5.7). We then take the product of 
(5.6) over p with p|N and (5.7) over all p with p/ N: 

ne 1 1 
Typ = = = =r 
es . irr Nite aor od ‘ 
But, by part (a) of the proposition, when we multiply together the sums on 
the left in this equality, we obtain 2 7,,n~*, where the sum is over all positive 
integers n. The proof is exactly like the proof of the Euler product for the 
Riemann zeta-function. We use the factorization n = pf! --- p?r, and the 
relation: 7,n°-* = (Tha, pi*") «++ (TyP, 7"). We hence conclude that 
2 1 1 
T,n °= = = aoe: 
2, lli=a lizre ce 2 

For d an integer prime to N, let [d]: & > & be the linear map defined 
on basis elements by [d]e,,, = ea. Note that dt has exact order N in C/L 
because g.c.d.(d, N) = 1. Also note that [d] depends only on d modulo N, 
i.e., we have an action of the group (Z/NZ)* on &. 

We now consider functions F on modular points and the corresponding 
functions f(z) on H. Again we suppose that we are in the case I’ = I, (NV). 
If T: & > F is a linear map given on basis elements by equations of the 
form T(e,,,) = 24,ép,, then we have a corresponding linear map (which we 
also denote 7) on the vector space of complex-valued functions on modular 
points: TF(L, t) = 2a,F(P,). For example, 


[d]F(L, t) = F(L, dt) (here g.c.d.(d, VN) = 1); 


(5.8) 


T,. FL) = wae(L ‘ (g.c.d.(n, N) = 1); (5.9) 


TFL, 1) =—Y FL’, 
> 
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where the last summation is over all modular points (L’, ¢) such that [L’: L] 
=n, as in (5.2). 


Proposition 33. Suppose that F(L, t) corresponds to a function f(z) on H 
which is in M,(T,(N)). Then (dF, T,,,,°, and T,,F also correspond to functions 
(denoted [d]f, T,,,f, and T,f) in Ss (N)). If fis a cusp form, then so are 
[d]f, T,,,f, and T,f. Thus, [d], T,,,, and T, may be regarded as linear maps 
on M,(T,(N)) or on S,(0,(N)). In this situation, let x be a Dirichlet character 
modulo N. Then fe M,N, x) if and only if [d|F = x(@)F, i.e., if and only if 


F(L, dt) = 7(d)F(L, t) for de(Z/NZ)*. (5.10) 


Proor. To show that [d]/ 7,,,fand T,,fare invariant under [y], for yeT',(N), 
by Proposition 31 it suffices to show that La) F(L, t), T,,,F(L, 0), and T,F(L, t) 
have weight k, ie. [d]FUL, A) = A*[d | F(L, t), T,,,F(L, At) = 
A*T, ,F(L, t), and T, FOL, At) = 2-*T,, F(L, 0); but this all follows trivially 
from the definitions. We next check the condition at the cusps. 

Note that if « = (4 5)e¢GL}(Q) and if f(z) corresponds to F(L, f), then 


Ale)|[et], = (det 99"(cz + a+ (te x) 


= (det a)? F (Lacrnerre Ht 


In particular, if «eT, then this equals F(L,, (cz + d)/N). Next, for each 
de(Z/NZ)*, choose a fixed o,€T such that o, = (‘47 9) mod N. (This is 
possible, because g.c.d.(d, N) = 1, and the mapT > SL,(Z/NZ) is surjective, 
by Problem 2 of §III.1.) We then have 


flak = F(te.2)=[dlF(t.. 2) = (ave) 


ie., [d]f=f|[o,],. Thus, for yy¢T we check (3.8) for [d]f as follows: 
[4] f |Lvolk =f |Louyo],, which has a q-expansion of the required type, i.e., 
[d] fsatisfies (3.8) if fdoes. Similarly, we find that T,,,f(z) =n 7FGL,, #) = 
n~?F(L,, x) = n* *[n] f(z), so this case has already been covered. 

We next consider the cusp condition for T,, f(z), which is a sum of functions 
of the form 4F(L’, x), where L’ is a lattice containing L, with index n. We 
take such an L’ and let (@,, w,) bea basis for L’. Since L, < L’ with index n, 
there is a matrix t with integral entries and determinant n such that (7) = tw 
(«@ denotes the column vector with entries w, and w,). We can choose a set 
T of such t (independently of z) such that 


T,f(Z) = : dX F(t i). 


ine Oy consider each F(L,,, 4), where (7) = (1). We find a yeT such that 
=7(6 %) with zero lower-left entry and a, b, d integers with ad =n 


160 III. Modular Forms 


(it is an easy exercise to see that this can be done). The lattice spanned by 
(2:) = 1 *(j) is the same as that spanned by (3!) = #(6 @)(i). Thus, 


2 


1 1 a 
F(t x) = vere x) = a‘F (Eero 2) 


= a'{a] f((az + b)/d). 


But if f(z) satisfies (3.8), then so does f((az + b)/d), and [a] also preserves 
(3.8), as shown above. This proves the cusp condition for 7, f- 

Finally, let y;¢T(N) be any element with lower-right entry d. As shown 
above, f(z)|[ya, = F(L., %) = [d] F(L., x). Thus, f|[y.], corresponds to 
[d]F, and so f|[yi], = x(@)/ if and only if [d]F = (d)F. This completes 
the proof of the proposition. Qo 


We saw before (Proposition 28) that a function fe M,(I,(N)) can be 
written as a sum of functions in M,(N, x) for different Dirichlet characters 
y. Thus, using the one-to-one correspondence in Proposition 31, we can 
write a modular form F(Z, ft) as a direct sum of F’s which satisfy (5.10) for 
various 7. 


Proposition 34. The operators T, and T,, , commute with [d], and preserve the 
space of F(L, t) of weight k which satisfy (5.10). If FL, t) has weight k and 
satisfies (5.10), then T,,,F = nk? y(n)F. 


PRoor. That the operators commute follows directly from the definitions. 
Next, if [d]F = y(d)F, it follows that [d]7,F = T,[d]F = x(@)T,F and 
[4] Ti nF = Ty nLd|F = x(@)T,,,F. This is just the simple fact from linear 
algebra that the eigenspace for an operator [d] with a given eigenvalue is 
preserved under any operator which commutes with [d]. Finally, if F(Z, 1) 
satisfies (5.10), then T7,,,F(L,t) = n ?FGL,t) = n7F(L,nt) = 
nk? Tn] F(L, =) = n*-7 y(n) F(L, 0). g 


If we translate the action of T,, T,,, and [d] from functions F(L, t) to 
functions f(z) on H, then Proposition 34 becomes 


Proposition 35. 7,, and T,,,, preserve M,(N, x), and also S,(N, x). For 
feEM,\(N, x) the action of T,,, is given by T,,,f =n? x(n)f. 


Proposition 36. The operators T, on M,(N, x) satisfy the formal power series 
identity 
y Get= [1d - Ret tape. (5.11) 
n=1 allp 
PRroor. We simply use (5.8) and observe that for p}{N we have 7, ,f= 
p*?x(p)f, while for p|N the term on the right in (5.11) becomes (1 — 
T,p *)* because x(p) = 0. o 
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As a special case of these propositions, suppose we take x to be the trivial 
character of (Z/NZ)*. Then M,(N, 7) = M,(T(¥)), and the modular forms 
SEM,(N, x) correspond to F(L, t) on which [d] acts trivially, i.e., F(L, 1) = 
F(L, dt) for all de(Z/NZ)*. Such F(L, ¢) are in one-to-one correspondence 
with functions F(L, S), where S is a cyclic subgroup of order N in C/L. 
Namely, choose ¢ to be any generator of S, and set F(L, S) = F(L, t). The 
fact that F(L, t) = F(L, dt) means that it makes no difference which generator 
of S is chosen. Conversely, given F(L, S), define F(L, t) = F(L, S,), where 
S, = Zt is the subgroup of C/L generated by t. So we have just verified that 
functions of modular points in the sense of case (iii) at the beginning of this 
section correspond to modular forms for [p(V). 

We now examine the effect of the Hecke operator T,, on the g-expansion 
at oo of a modular form f(z)—_M,(N, x). That is, if we write f(z) = Xa,q" 
and T,, f(z) = 26,q", q = e?"'*, we want to express 5, in terms of the a,. 

We first introduce some notation. If feC[[q¢]], f= 24,9", we define 


Vat = Langs Unf = Yana, (5.12) 


where the latter summation is only over n divisible by m. Note that U,; = 
V, = identity, and U,,0 V,, is the identity, while V,, 0 U,, is the map on power 
series which deletes all terms with n not divisible by m. Suppose that f(z) = 
Lag", q = e?", converges for ze H. Then we clearly have: 


Vafle) =floma); Unf) =F", s(2#4). (5.13) 


Proposition 37. Let f(z) = Epo aq", q =", fe M\(N, 1), and let T,f(2) = 
rr 0 5,9". Then 
By = Ayn + X(P)P nip, (5.14) 


where we take x(p) = 0 if p| N and we take a 
In other words, 


= 0 ifn is not divisible by p. 


n/p 
T, = U, + x(p)p*"'V, on M,(N, 2. (5.15) 


Proor. We have 7, f(z) = 72, F(L’, x), where F is the function on modular 
points which corresponds to f and the sum is over all lattices L’ containing 
L, with index p such that } has order N modulo L’. Such L’ are contained in 
the lattice }L, generated by 3 and $, and the lattices of index p are in one-to- 
one correspondence with the projective line P} over the field of p elements 
F, = Z/pZ. Namely, the point in P?, with homogeneous coordinates (a,, ay) 
corresponds to the lattice generated by L, and (a,z + a,)/p. Thus, there are 
p +1 possible L’ corresponding to (1, /) for j= 0, 1, ..., p — 1 and (0, 1). 
If p} N, then all p + 1 of these lattices L’ are included; if p|N, then the last 
lattice (generated by L, and 4) must be omitted, since 4 has order ¥ in that 
case. Note that the lattice cenetated by L, and (z + j)/p is Lin+ Dp. Thus, if 

p|N we have T, f(z) =4222) FLysijyps b) = dD S EY = Uf). If 
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pIN, then we have the same sum plus one additional term corresponding 
to the lattice generated by L, and 5; this lattice is }L,,. Thus, in that case 


T, fle) = U, fle) + nie x) = U, f(z) + PoP (Ly 2) 


= pt (2) +p" y(p)F ee x) = U,f(Z) + p**y(p)f(p2), 


which is (5.15). The expression (5.14) for the 5, follows directly from (5.15) 
if we use the expressions (5.12) for the operators V, and U,,. oO 


As a consequence of Proposition 37 we have the factorization 
1 — 7,X + y(p)p**X? = (1 — U,X)(1 — x(p)p*""V,X), (5.16) 


where both sides are regarded as polynomials in the variable X whose 
coefficients are in the algebra of operators on the subspace of C[[¢]] 
formed by the g-expansions of elements f(z) ¢ M,(N, x). To see (5.16), note 
that the equality of coefficients of X is precisely (5.15), while the coefficients 
of X? agree because U,o V, = 1. 


Proposition 38. We have the following formal identity : 


3 T,n § -( y(nynk! rar)(S un), (5.17) 


n=1 n=1 


or, equivalently, 
T, = Y x(d)d* V0 ya: (5.18) 


d|n 


ProoF. By (5.11) and (5.16) we find that the left side of (5.17) is equal to 
[1 — U,p-)d — x(p)p* Vp). 
Pp 


Since the U, and V, do not commute, we must be careful about the order of 
the factors. Moving the inverse operation inside the outer parentheses, we 
reverse the order, obtaining 


Ia — x(p)p* Vp) "(1 — U,p*)"). 


Note that U, and V,, do commute for p, # p2, as follows immediately from 
(5.12). This ‘enables us to move all of the (1 — U,p~*)"! terms to the right 
past any (1 — x(p2)p5'V, Pr) ‘ term for p, # p. This gives us separate 
products with the V,’s ond with the U,’s: 
1 1 
Ty — x(p)p*"'V,p aij ee) ae 

We now expand each term in a geometric series and use the fact that V,,,, = 
V,,0V, and U,,, = U,,0 U,,. The result is (5.17). o 
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Proposition 39. Under the conditions of Proposition 37, if T,,f(Z) = UR-9 5,9", 
then 
b= YX) Ananya (5.19) 


d|g.c.d.(m,n) 


Proor. According to (5.18), we have 
Ls >» ang” = x x(d)d*"'V,0 Ona > ang” 
d|m n=0 


n=0 
= Px@)at* ag? 
d|m m/d|n 
If we set r = d?n/m, the inner sum becomes 2 4,m/q2q" with the sum taken 
over all r divisible by d. Replacing r by n and gathering together coefficients 
of q", we obtain the expression (5.19) for the m-th coefficient. Oo 


Most of the most important examples of modular forms turn out to be 
eigenvectors (‘‘eigenforms’’) for the action of all of the 7,, on the given space 
of modular forms. If fe M,(N, y) is such an eigenform, then we can conclude 
a lot of information about its g-expansion coefficients. 


Proposition 40. Suppose that f(z)—eM,(N, x) is an eigenform for all of the 
operators T,, with eigenvalues 4,,, m= 1,2, ...: In f=Apf. Let Gy, be the 
q-expansion coefficients: f(z) = U5 a,g". Then ay, = A,_,a, for m=1, 2, 
.... In addition, a, #0 unless k =0 and f is a constant function. Finally, if 
ay # 0, then 2,, is given by the formula 

A= yaa. (5.20) 

d|m 

Proor. Using (5.19) with w = 1, we find that the coefficient of the first power 
of gin T,, fis a,. If T,,f=4,Jf, then this coefficient is also equal to A,,d,. 
This proves the first assertion. If we had a, = 0, then it would follow that 
all a,, = 0, and f would be a constant. Finally, suppose that a, 4 0. If we 
compare the constant terms in 7,,f=4,,f and use (5.19) with n= 0, we 
obtain: 1,49 = bo = Lam y(d)d*a,. Dividing by a, gives the formula for 
Am im] 


If fis an eigenform as in Proposition 40 (with k ¥ 0), then we can multiply 
it by a suitable constant to get the coefficient of g equal to 1: a, = 1. Such 
an eigenform is called “‘normalized”’. In that case, Proposition 40 tells us 
that a,, = /,, is simply the eigenvalue of 7,,. If we then apply the operator 
identity (5.11) to the eigenform /, we obtain identities for the q-expansion 
coefficients of #/ Namely, applying both sides of (5.11) to a normalized 
eigenform fe M,(N, x), we have: 


oO 


a,n-*=[](1 —a,p* + x(p)p*"-*5)?. 
=1 P 


n 
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ExampLes. 1. Let N = 1, and let k > 4 be an even integer. Suppose that 
f= a,q"—M,(1) is an eigenform for all of the 7,,, and suppose that ay # 0 
and f is normalized (i.e., a, = 1). Then Proposition 40 says that f= dy) + 
2o,_,(n)q". But recall the Eisenstein series FE, = 1 — LX o,_1(n)q" (see 
(2.11)). Thus, f and soe are two elements of M,(I) which differ by a 
constant. Since there are no nonzero constants in M,(I), they must be equal, 
1.€.,d) = —B,/2k. Therefore, up to a constant factor, any weight k eigenform 
for T which is not a cusp form (i.e., dg # 0) must be £,. Conversely, it is 
not hard to show that £;, is actually an eigenform for all of the 7,,. This can 
be done using the g-expansion and (5.14) (it suffices to show that it is an 
eigenform for the 7,, since any 7, is a polynomial in the operators 7, for p 
prime). Another method is to use the original definition of 7,, on modular 
points, applied to G,(L) = Zoz;<,/*. 

2. Let N=1, k=12. Since S,,(1) is one-dimensional, spanned by 
(2x) 1*A(z) = ZZ, t(n)q" (see Propositions 9(d) and 15), and since the 7, 
preserve S,(I), it follows that f= Xt(n)q" is an eigenform for T,,. It is 
normalized, since t(1) = 1. Then Proposition 40 says that T,, f= t(m)/f for 
all m. Thus, the relation (5.11) applied to f gives 


= 2 1 
t(n)n = —s —2s° 
Pp Hmerrese sige ce 


This Euler product for the Mellin transform of f= 2X t(n)q" is equivalent to 
the sequence of identities (see Proposition 32): 


(5.21) 


t(mn) = t(m)t(n) for m, n relatively prime; 


t(p') = t(p!")t(p) — pt*t(p"”). 

Ramanujan conjectured that in the denominator of (5.21) the quadratic 
1 — t(p)X + p'!X? (where X¥ = p~‘) has complex conjugate reciprocal roots 
a, and %,; equivalently, 1 — t(p)X + p''X? =(1—4,X)(1 —&,X), ie., 
t(p) = 4, + &, with |x,| = p'’”. From this it is easy to conclude that |t(p)| < 
2p'*?, and, more generally, |t(m)| < o9(n)n''? (see the proof of Proposition 
13 in §II.6). The Ramanujan conjecture and its generalization, the 
Ramanujan-—Petersson conjecture for cusp forms which are eigenforms for 
the Hecke operators, were proved by Deligne as a consequence of the Weil 
conjectures (see [Katz 1976a]). 

The Euler product (5.21) is reminiscent of the Hasse—Weil L-series for 
elliptic curves. For more information on such connections, see [Shimura 
1971]. 

3. Let N= 4, x(n) = |) = (—1)""?” for n odd. We saw that M,(N, x) 
is one-dimensional and is spanned by ©?. If we apply Proposition 40 to 
1@7=1444+---+4,q"+ +++, we find that /,, = = (-)), where the sum 
is over odd d dividing m. For example, 


—] 0 ifp =3(mod 4); 
aoe | = 5.22 
Ap +(=4) : if p = 1 (mod 4). Of) 
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Since /,, is one-fourth the number of ways m can be written as a sum of two 
squares, we have recovered the well-known fact that p can be so expressed 
in eight ways if p = 1 (mod 4) and in no way if p = 3 (mod 4). That fact is 
usually proved using factorization in the Gaussian integers Z[i]. The 
Gaussian field Q(i) has discriminant —4, and corresponds to the character 
x of Z having conductor 4: y(n) = (2). Let 1 denote the trivial character of 
(Z/4Z)*, and let p denote the two-dimensional representation (3 4) of 
(Z/4Z)*. Then we can write (5.22) in the form 4, = Tr p(p). This turns out 
to be a very special case of a general fact: a normalized weight-one eigenform 
in M,(N, x) has its g-expansion coefficients determined by the trace of a 
certain two-dimensional representation of a certain galois group. For more 
information about this, see [Deligne and Serre 1974]. 


Another approach to Hecke operators. Let T, and I’, be two subgroups of 
some group G. 


Definition. [', and I’, are said to be ‘“commensurable”’ if their intersection 
has finite index in each group: [T,:T, aT,] < 0, [1):1,0T,] <0. 


Basic ExampLe. Let I’ be a congruence subgroup of T = SL,(Z), and let 
ae€G=GLj(Q). Then I’ and « 1I’a are commensurable. To see this, 
suppose I’ > T'(NV). By Lemma | in the proof of Proposition 17, we have 
aa !la«>T(ND) and I’nal’a! >T(ND) for some D. Let Fr” = 
aa!’ Then T” >F(ND), oF’a ! >T(ND). Thus, [T’:T’] < 
[C: P(ND)] < 00, and also [a 'T’a: 0] =[I’: oF’a"'] < (1: P(ND)] 
< ©. 


Definition. If T’,, , < G and «eG, then the double coset I, aI, is the set of 
all elements of G of the form y, ay, with y,ET,, y,¢T,. Notice that To. , 
contains the right coset I’, «, and in general is a union of right cosets of the 
form I, ay. 


Proposition 41. Let I’ < G be any subgroup of a group, and let «€G be any 
element such that TY’ and «1T’« are commensurable. Let T’ =T’ na Ta. 
Let [1’: T’] =d, and write T’ = (§_, yj. Then Vol’ = (4_, M’ayy is a 
disjoint union of d right cosets. Conversely, if V’aI’ = \_)4_, Vay; is a disjoint 
union of d right cosets, then’ = \ )4_, Ty. 


Proor. Given an element },«y, with y,, y,€I’, we can write y, = yy; with 
y’el” for some j. Since y” ea 1I’a, we can write y’ =a 'y’a, so that 
VO = yiala* yay = (Hy oyy € T’ay;. We must show that the right cosets 
I’ay; are distinct for different j. Suppose y,a); = y207,. Then yjy) = 
amt yy ty,aea'T’a. Since yyy eT’, it follows that y/y,-' eT”, ie., eT’, 
and so 7 =k. The converse assertion is also routine, so we shall omit the 
details. oO 
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We shall define Hecke operators in terms of double cosets for a large class 
of congruence subgroups of I’, a class which includes T’,(N), [p9(V) and 
I'(N) as special cases. Then we shall show that in the case of I',(4) this 
definition coincides with our earlier definition of the operators 7, acting on 
MT (N)). 

Let S* be a nonzero additive subgroup of the integers, i.e., S* = MZ for 
some positive integer M. Let S* be a subgroup of (Z/NZ)*. We shall also 
use S* to denote the subset of Z whose image modulo N is in S*. (If N = 1, 
then we agree to take S* = Z.) For example, if S“ = {1}, then we also use 
S* to denote 1+ NZ c Z. Let n be any positive integer. We define 
A'(N, S*, S*) = 


qq {integer matrices (? j)| N|c, aeS*, be S*, det (2 §) =n}. 


(5.23) 


If N=1 and S* = S* =Z, then A” is simply the set of all 2 x 2-matrices 
with determinant n. It is easy to check that 


A™(N, S*, S*)- A", S*, St) < A™(N, S*, S*), (5.24) 


and that A1(N, S*, S*) is a group. A1(N, S*, S*) is clearly a congruence 
subgroup of I, since it contains T(N’), where N’ is the least common 
multiple of M and N (recall S* = MZ). Here are our familiar examples: 


T,(N) =A?(N, {1},Z);  — To(N) = At, (Z/NZ)*, Z); 
T(N) = A1(N, {1}, NZ). 


Definition. Let I’ be a congruence subgroup of I, and let «¢GL}(Q). Let 
l=’ ona'Ta, and letd = [1:1], 1 = (9-07. Let f(z) be a func- 
tion on H which is invariant under [y], for yeT’. Then 


d 
SOW hg ¥ SOM). (5.25) 


Proposition 42. f(z)|[I'’oI"’], does not change if « is replaced by any other 
representative «' of the same double coset: T’a’T’ =Y’al’. Nor does it depend 
on the choice of representatives y; of T’ modulo 1”. If feM,(I"’), then 
f\Wol’],€ M,(T"). 


PROOF. We first prove the second assertion, that is, that (5.25) is unchanged 
if y; is replaced by y;/’y;, where y/ eT”. Since” < a” *T’a, we have y’ = a "F,0 
for some #j€ 1”. Then f |[orf’y x =/ [Lajas] =F Lary e» because f |[5], =S. 
Next, we observe that, by Proposition 41, the sum on the right in (5.25) can 
be written © f(z)|[a;],, where the «, are any elements such that Tal’ = 
| )Y’a;. It is then immediate that the definition depends only on the double 
coset I’’aI” and not on the choice of representative «. Finally, suppose that 
feM(V). If yet’, then (for Wllvk =2/|Lorj7hk =/1 .. 
since right multiplication by y just rearranges the right cosets T’ay;. If f 
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satisfies the cusp conditions, then so does f|[wy;], for each j, by Lemma 2 
in the proof of Proposition 17. This completes the proof of Proposition 42. 
Oo 


Definition. Let I’ = A'(N, S“, S*), and let n be a positive integer. Let 
feM,(T’). Then 


LS = ner ys SLE ok Is (5.26) 


where the sum is over all double cosets of TP’ in A"(N, S*, S*). 
By Proposition 42, we have T,, fe M,(T’). 
Equivalently, we can define 


Ti fSnte* y Fla les (5.27) 
where I’«; runs through the right cosets of I’ in A"(N, S*, S*). 


Proposition 43. In the case T’ =1,(N), the definition (5.26) agrees with our 
earlier definition of the Hecke operators T,,. 


Proor. Let A" = A"(N, {1}, Z). For each ae(Z/NZ)* we fix o,€T such that 
o, = (*( °) mod N. oO 


Lemma. 


a b 

A*= |) IT, We, ( ) (5.28) 
disjoint 0 d 

where the disjoint union is taken over all positive a dividing n and prime to N, 

and for each such a we set d= n/a and takeb=0,1,...,d—1. 


PROOF oF LEMMA. The terms on the right in (5.28) are clearly contained in 
A". Suppose the union were not disjoint. Then for some a’, b’, d’ = n/a’ we 
would have y,0,(6 >) =720,(4 %), and so I would contain the matrix 
(6 a a) 1 =(%S aly); then a’ =a, d’ =d, and so (8 5) =(8/)(4 5) for 
some j, 1.e., 6 = b’ + jd. If 0 < b, b’ < d, this means that b = b’. 

To prove the lemma, it remains to show that any « = (7, 5)eA" is in one 
of the terms on the right in (5.28). Choose g, A relatively prime so that 
ga’ + hc’ = 0, and complete the row g, h to a matrix y = (¢ fel. Then ya 
has determinant n and is of the form (% *). Replacing y by +(} /)y if neces- 
sary, without loss of generality we may suppose that ya =: (4 °) with a> 0, 
ad =n, 0 <b <d. Then, considered modulo N, the equality « = y~1(4 ° 
gives (6 *) = (0 *)(G 2), ie., y’ is of the form ,0, for some y, €T,(N). 
Thus, « = y,0,(6 {)€T,(N)a,(6 °), as desired. This completes the proof of 
the lemma. oO 


We now prove the proposition. Let T7*” be the linear map on M,(T, (N)) 
defined by (5.26) (or (5.27)), and let T!4 be the earlier definition. Since 
MT, (N)) = ©,M,(N, 7), it suffices to show that 72°” = Te'4 on M,(N, x). 
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Let fe M,(N, x). By the lemma, we have 
Tre f= nO) f [old de- 


Since fe M,(N, x) we have f|[o,], = x(a)f. Also, for each a > 0 and d with 
ad = n we have 


LT PAILG Dk = > nkl2d- "(ee + *) 


= n'Pd*V,odU,f(2), 
by (5.13). Thus, 
| Aas Bee nbkl2)-1 Y xan? d'*V,0 Ui f. 
aln 
(The sum is over a > 0 dividing 1 and prime to N; but the term x(a) ensures 
that the terms with g.c.d.(a, N) > 1 will drop out.) That is, 


Petes SY (Ga Vou gate 


aln 


by (5.18). This proves Proposition 43. Oo 


In many situations, the definition of 7, in terms of double cosets is more 
convenient than the definition in terms of modular points. For example, we 
shall use this definition below to show that 7, is a Hermitian operator with 
respect to the Petersson scalar product. Moreover, the double coset approach 
can be generalized to situations where no good interpretation in terms of 
modular points is known. We shall see an example of this in the next chapter, 
where we shall define Hecke operators on forms of half-integral weight. For 
a more detailed treatment of the double coset approach in a more general 
context, see [Shimura 1971]. 


The Petersson scalar product. Suppose we have an integral over some region 
in the upper half-plane and make the change of variable z->az = “4%, 
where « = (¢ 5)€GL}(Q). If we have a term dxdy/y” in the integrand, this 


does not change under such a change of variable. To see this, we compute: 


daz deta | Imaz_ deta 
dz (cz +d)?’ Imz |ez+d|?" 


(5.29) 


In general, if we make a differentiable change of complex variable z, = u(z), 
(see Fig. III.5). 
Thus, interpreting the first equality in (5.29) in terms of the real variables 
x = Rez and y=Im_z, we see that an element of area near z is expanded 
by a factor |daz/dz|? = (det «)?/|cz + d|*. Thus, dxdy/y? is invariant under 
the change of variables, by (5.29). 


Proposition 44, Let T’ <T be a congruence subgroup, let F’ < H be any 
fundamental domain for TY’, and define 
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Figure IJI.5 


, dxd 
wl) = | = (5.30) 
" 


Then 


(a) The integral (5.30) converges and is independent of the choice of F’. 


(b) (1: T’] = n0)/u). 
(c) IfseGLz(Q) and aa <7, then (1: PF] =[T: a 'P’a]. 


Proor. First let [[:I’] =d, let T = (4,4,0’, and take F’ = Var r. 
Notice that the integral for u(I) converges, because 


Gay <|- I. y *dydx ie 
F y 1/2 J V3/2 V3 


If for each j we make the change of variables z++a,z, we find that 
Sa >i dxdy/y? = |, dxdy/y”; hence, for our choice of F’ we find that the 
integral i in (5.30) converges to du(I). Suppose we chose a different funda- 
mental domain F, for I’. Then we divide F, into regions R for which there 
exists EI’ with oR c F’, and again we use the invariance of dxdy/y? under 
z++az to show that the integral over Rc F, and the integral over the 
corresponding region «R in F’ are equal. Finally, to show part (c), we note 
that «”1F’ is a fundamental domain for «!T’a. Since 


| ae = | dxdy 
alr’ y? F’ y? , 


it follows that u(a~'T’w) = w(I’), and so part (c) follows from part (b). 


Now suppose that f(z) and g(z) are two functions in M,(I’). We consider 
— function f(z)g(z)y*, where the bar denotes complex conjugation and 

=Imz. If we replace the variable z by az for «€¢GL3(Q), we obtain: 
Aing@ary'idet a/|cz + d|?)*, by (5.29). But this is just (/(2)|[o],) 
(g(z)|[~],)y*. Thus, the effect of the change of variable is to replace f by 
Ff \(#], and g by g|[],. 


Definition. Let I’ < F be a congruence subgroup, let F’ be a fundamental 
domain for I’, and let f, g¢ M,(I’), with at least one of the two functions f, 


170 III. Modular Forms 


g acusp form. Then we define 


oa ==> (5.31) 


It is immediate from this definition that < f, g> is linear in fand antilinear 
in g (i.e., <f, cg> = C< f, g>), it is antisymmetric (i.e., <g, f> = «fi g>), and 
also <f, f> > 0 for f# 0. These are the properties one needs in order to 
have a hermitian scalar product. We shall return to this later. 


Proposition 45. The integral in (5.31) is absolutely convergent, and does not 
depend on the choice of F’. If T” is another congruence subgroup such that 
fgE€M,(0"), then the definition of < f, g> is independent of whether f, g are 
considered in M,(1’) or in M,(TU”). 


Remark. The term 1/[I':T’] in (5.31) is needed in order to have the 
second assertion in the proposition. The requirement that for g be a cusp 
form is needed to get convergence of the integral, as we shall see. 


Proor. First take F’ = |) aj 1F, where T = () aI”. In each region «; 1F make 
the change of variables which replaces z by a; *z, thereby transforming the 
integral into 


y aN fellas oT TS 


Since f, ge MAU’), we can write f |[a; “Tk = zr0 Un INs g\Loy "I. id Zn=0 b,4N> 
with either a, = 0 or b, =0, since f or g is a cusp form. Because |gy| = 
\e27/N| — e-2n¥N and y > *3 in F, and because a, and b, have only polyno- 
mial growth (see Problem 19 in §3), it is not hard to see that the integral is 
absolutely convergent. Next, if F, is another fundamental domain, we 
proceed as in the proof of Proposition 44, comparing fx f(z)g(z)y* 7dxdy 
with |,x/(z)g(z)y* 2dxdy, where R is a subregion of F, and «R, with weI’, 
is the corresponding subregion of F’. Making the change of variable 
zt»az and using the fact that f|[«], =f, g|[~], =g, we find that the two 
integrals are equal. 

Finally, suppose f, geM,(I”). First suppose that TP” <I’. Writing 
Tl’ =(|),6,0’, F” =|) 6;1F’, we have 


,axd . 


[T: mri, 
1 

ius cal iF Fa4 

But all of the summands on the right are equal, since f|[0;"], =f, g|Lor - = 

g; and there are [T’: ’”] values of /. We thus obtain the right side of (5.31). 

If” ¢ VT’, we simply set 1” =I” NI’, and show that (5.31) and (5.31) 


with I” in place of I’ are each equal to (5.31) with I” in place of I’. This 
completes the proof of Proposition 45. oO 


k a 


Z| AO @llr 
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Proposition 46. Let f, g¢M,(I’) with f or g a cusp form. Let we GL} (Q). Then 
<h9> =<S\[ole lle. (5.32) 


Proor. Let TP” =I’ nal’a!. Then f, geM,(T”), and f|[«],, g|[o],€ 
M,(a7!T’a«) by Proposition 17(a). Here « 'T’a =a 'T’a nT’. Let F” bea 
fundamental domain for I”, and take a !F” as a fundamental domain for 
a 1P”’a. Then the right side of (5.32) is 
,dxdy 

y? 


Taree | _, Follkg@ lll 


a 1 dxdy 

[Ts ot P’«] yr 

Since [T': « 1I°’a] =[I°: T’’] by Proposition 44(c), we obtain the left side 
of (5.32). Oo 


| fog@y* 


Now for «€ GL3 (Q) we note that « can be multiplied by a positive scalar 
without affecting [«],. So without loss of generality we shall assume in 
what follows that « = (¢ 5) has integer entries. Let D = ad — bc = det a, and 
set ao = Da 1 =(4 ~). Then [a1], = [o’],. 


Proposition 47. With f, g, « as in Proposition 46, 


Cf |Loe > = <f lle]. (5.33) 
In addition, < f \[«],, g> depends only on the double coset of « modulo I’. 


Proor. If in (5.32) we replace g by g|[«~*], and replace I’ by I’ nal’a™', 
then Proposition 46 gives (5.33). Now suppose we replace « by y, ay, in 
<f|Lol.. g>. We obtain Cf Dyer) Q= «fll: g\lyz" > = <f lle 
g>, because f is invariant under [y, ], and g is invariant under [y;"],, since 
V1.9 EI". oO 


Proposition 48. Let T’=1,(N), A"=A"(N, {1}, Z) (see (5.23)). Let f, 
géEM,(T’) with f or g a cusp form. For each de(Z/NZ)*, fix some o,€T such 
that o, = (‘4 $) mod N. Let n be a positive integer prime to N. Then CT,.f, g> 
=f |Lon]e, Tag). Inparticular, iffe M,(N, 7), then <T,f, g> = xln)< f, Tag 
for n prime to N. 


Proor. If «=(2 5)eA" then «=() >) mod N, a =(% 4°) mod N, and 
o, 0 =(4 ~*'") mod N. Thus, o,«’ € A". By definition, 


Tf = nh Y. f(a Ie 


where the sum is over the distinct double cosets of I’ = A? in A”. Let d, 
denote the number of right cosets in aI’. Then d, = [I’: T’ na 11a] by 
Proposition 41. By (5.26), (5.25) and the second assertion in Proposition 47, 
we have 
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CTS g> i; et 4h |[oh. g>> 


where the sum takes one « from each double coset. The map Tal’ 
I'’(¢,«’)I’ permutes the double cosets. Namely, if we had a,a, and o,3 in 
the same double coset y, 0,0; 7, = 0,03, then taking inverses and multiplying 
by 7 would give 


=I =, Sh. -1 
Y2 %1O, Py = 429, °, 


i.€., &, = 72%,(0, 'y,0,). Since o,1I,(N)o, =1T,(N), we have “,ET aT’, 
and so «, and «, are in the same double coset. Next, we observe that FO 
d,, because 


I’ a (6,0) 11 'o,0 =T’ nao'T’o,07 =’ nal’a7! 
=a(atWaanrya}, 
and so 
d= (WV na Va] = (2 PA (6,0) V0] = dopas 
by Proposition 44(c). Thus, 


TL 9» = OY dana I \COn® Tas > = 2 Vf [Lone 9|Lo]e> 


by Proposition 47. This equals <f|[o,],, T,g>, as desired. Finally, if 
feM,(N, x), then f |[o,] = x(n)f, so we obtain the final equality. Q 


A basis of eigenforms. We first remark that for any congruence subgroup 
I’ and any integer k, the C-vector space M,(I’) is finite dimensional. 
One way to show this would be to take a fundamental domain F’ for 
I’ of the form F’ = |) a;'F and integrate f’(z)/f(z) around the boundary 
for nonzero fe M,(I’), as in the proof of Proposition 8 of §III.2, thereby 
obtaining a bound on the total number of zeros of fin a fundamental domain. 
But if /,(I’) were infinite dimensional, by taking suitable linear combina- 
tions one could obtain nonzero fe M,(I’) which vanish at any given finite 
set of points in F’. Alternately, one could prove finite dimensionality, and 
even obtain formulas for the dimension, using the Riemann—Roch theorem 
(see [Shimura 1971, §2.6]). 

The Petersson scalar product (5.31) gives a hermitian scalar product on 
the finite dimensional C-vector space S,(I’’). That is, < f; g> is linear in fand 
antilinear in g, it is antisymmetric, and < f, f> > 0 for f ¥ 0, as we remarked 
when we gave the definition (5.31). 


Proposition 50. Let n be a positive integer prime to N, and let y be a Dirichlet 
character modulo N. Let c, be either square root of %(n). Then the operator 
c, 1, on S,(N, x) is hermitian, i.e., (c,T,f, 9> = <f, CnTng>- 


PROOF. (¢,T,f, 9> = CnXTrf 9> = CnXO)< SA Trg> = Cnn <S, Thg> = 
Gn< Sf, Thg> = <f; CnTyg>, as claimed. g 
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We saw before that eigenforms for the 7, have nice properties: the 
coefficients can be expressed in terms of the eigenvalues for the 7,, and the 
corresponding Dirichlet series have Euler products. Because of Proposition 
50, it is possible to find a basis of such forms. 


Proposition 51. There exists a basis of the C-vector space S,(N, x) whose 
elements are eigenforms for all of the T,, for which g.c.d.(n, N) = 1. 


ProoF. For any fixed 7, with g.c.d.(n, N) = 1 and any subspace S < S,(N, y) 
which is preserved by 7,,, there exists a basis of S consisting of eigenforms 
of T,. To see this, we apply the following basic fact from linear algebra (see 
[Lang 1984, §XIV.12]) to the hermitian operator c,,T,,: Given any hermitian 
operator T on a finite dimensional C-vector space S, there exists a basis of 
S consisting of eigenvectors for T. We further note that any eigenspace for 
T,, is preserved by all 7,,, as follows from the fact that 7, and 7,, commute: 
if 7, f= 4,f, then 7,(7,, f) = T, Tf = 4,T,f. (This remark does not require 
n orn’ to be prime to N.) Thus, to prove the proposition, we list the T, for 
n prime to N (actually, it suffices to work with the J, for primes p} N, since 
any eigenform for them is an eigenform for the 7, with n prime to N). We 
write S,(NV, x) as a direct sum of eigenspaces S for the first 7, in the list. 
Then we write each S as a direct sum of eigenspaces for the next 7,,; then we 
write each one of those spaces (which is an eigenspace for the first two T,,’s) 
as a sum of eigenspaces for the third 7; and so on. Because S,(N, y) is 
finite dimensional, after finitely many steps this process must stop giving us 
any new smaller spaces. At that point S,(N, y) is expressed as a direct sum 
of subspaces on each of which the 7, for m prime to N act as a scalar. Any 
basis consisting of forms in these subspaces will satisfy the requirements of 
the proposition. oO 


Proposition 51 does not quite give us what we want, because of the 
restriction that n be prime to N. In order to have an Euler product (as, for 
example, in (5.21)), we would also want our basis forms to be eigenforms 
for T, for p| N. One way we could ensure this is if we found that the eigen- 
spaces for all of the 7, with m prime to N are each one-dimensional, because 
we know that the 7, for p|N commute with the 7, and so preserve each 
eigenspace. Such an assertion is called ‘‘multiplicity one”, i.e., each set of 
eigenvalues for the 7,, with n prime to N corresponds to only one eigenform 
in the basis that was constructed in Proposition 51. Multiplicity one does 
not hold in general; however, it does hold if we restrict our attention to 
forms which do not come from lower level. We now explain what this means. 

If d,d, = Nand fe M,(T,(d,)), then we also have fe M,(I',(N)) and also 
g(z) =S(d,z)€ M,(T,(N)) (see Proposition 17). The subspace of S,(I,(V)) 
spanned by the forms obtained in these two ways from forms fe S, (I, (d)) 
for proper divisors d| N is called the space of “old forms”. It is not hard to 
show that all Hecke operators preserve this space. The orthogonal comple- 
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ment to the space of old forms with respect to the Petersson scalar product 
is called the space of “‘new forms”’. In other words, a form fe S,(T,(N)) is a 
new form if and only if ¢f,g>=0 and <f, g|[@ °],> =0 for every ge 
S, (1, (d,)), where d,d, = N, 1 <d, < N. It can then be shown that the space 
of new forms has multiplicity one; thus, Proposition 51 holds for the space 
of new forms without the condition that n be prime to N. For details, see 
Chapter VIII of [Lang 1976]. 

Another question not answered by Proposition 51 is the existence of a 
basis of eigenforms for all of M,(T,(V)). Here we cannot consider < , > as 
a scalar product on M,(T,(N)), because < f, g> makes sense only if for g 
is a cusp form. Instead, one can use the explicit construction of Eisenstein 
series in $11.3 above, directly constructing eigenforms. Then M,(I,(V)) can 
be written as the orthogonal direct sum of S,(I,(V)) and a space spanned by 
Eisenstein series which are eigenforms. (In fact, an intrinsic definition of an 
Eisenstein series, which generalizes to many other situations, is: a modular 
form which is orthogonal to all cusp forms.) The dimension of the space of 
Eisenstein series turns out to be the number r of regular cusps of I’, (4). 
Roughly speaking, in order for a modular form to be a cusp form it must 
satisfy r conditions (vanishing of the constant term in the gy-expansion), one 
at each regular cusp; and so S,(I',(V)) has codimension r in M,(1,(4V)). For 
more information on Eisenstein series, see, for example, [Gunning 1962]. 


PROBLEMS 
1. Prove Proposition 31 in cases (iii)—(iv), i.e., F’ = Ty(V), PU). 


2. (a) Let ay =(2 ~4). We saw in Problem 15 of §III.3 that [ay ], preserves M,(Tp(N)). 
Prove that the Hecke operator 7;, commutes with a, for n prime to N. 

(b) Let F=Z,oa40,(")q", O* = X,,4,q"€ M(T(4)) (where a, equals the number 
of ways n can be written as a sum of four squares). Write the matrix of 7, in the 
basis ©*, F, and find a basis of normalized eigenforms for 7}. 

(c) Show that 7, does not commute with a, (see Problem 15(c) in §III.3). 

(d) Suppose nv is odd. Since T, commutes with a, and 7), it preserves each eigen- 
space in part (b) and each eigenspace in Problem 15(c) of §III.3. Show that the 
operator 7, on the two-dimensional space M,(T)(4)) is simply multiplication 
by o,(n). 

(e) Derive the following famous formula (see, for example, Chapter 20 of [Hardy 
and Wright 1960]) for the number of ways n can be written as a sum of four 
squares: 


n 


= ee (n) for n odd; 
240,(M9) form = 2'no even, 2} nN. 
3. If fe M,(N, x), show that f(0oo) = 0 implies 7}, f(oo) = 0, but that f(s) = 0 does not 


necessarily imply 7,, f(s) = 0 for other cusps s (give an example). 


4. (a) Show that if fe M,(N, x) and g(z) = f(Mz), then ge M,(MN, x’), where x’ is 
defined by x’(”) = x(n mod N) for ne(Z/MNZ)*. Let 7, be the Hecke operator 
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considered on M,(N, x), and let 7 be the Hecke operator considered on 
M,(MN, 7’). Suppose n is prime to M. Show that for fe M,(N, x) < M,(MN, x’) 
and g(z) = f(Mz)eM,(MN, 7’) we have: 


TIL = TS, Th g@) = (TS) (Mz). 


(b) Let f(z) = (n(z)n(2z))*, g(z) = f(2z) €Sg(T9(4)) (see Problem 18(d) of §III.3). 
Show that for odd n, 7, acts on the two-dimensional space S,(I9(4)) by multi- 
plication by a scalar. 

(c) Find the matrix of 7, acting on S,(I(4)) in the basis f, g; then find the basis of 
normalized eigenforms for all of the 7,. 


5. Let fi. ge M,(N, x) with f or g a cusp form. Let p|N. Note that T, = U,. Show that 
(Tf, > = P'S, V9). 


6. Let f, = (22) 7*A?, fy = (22) '*AE2, where (2z)7!7A = qII,(1 — q")** and E, = 
1 — 5042 6;(n)q". We know that S,,(I) is two-dimensional and is spanned by /; 
and f; (see §III.2). 

(a) Give a simple reason why /; is not an eigenform for the Hecke operators. 

(b) With the help of a calculator, find the matrix of the Hecke operator T, in the 
basis f,, fo. 

(c) Express in terms of /, and / the basis for S,4(I) consisting of normalized eigen- 
forms for all the 7,. 

(d) Express in terms of /, and f, the cusp form whose n-th g-expansion coefficient 
is the trace of 7, on S,,(I). 

(e) Find Tr 7; from part (d), and also directly by computing matrix entries. 


In this problem one encounters fairly large numbers; for example, the coefficients in 
part (c) are in @(,/144169). While Proposition 51 guarantees the existence of a basis of 
normalized eigenforms, it does not guarantee that it will always be easy to find it 
computationally. 


7. Let (L, t) be a modular point for [,(N). Let ce A"(N, {1}, Z), and let yeT,(N). For 
each « and », consider the lattice L’ = Lay. Show that L’ is a lattice which contains 
L with index n, that L’ depends only on the right coset of T,(N) in A"(N, {1}, Z) 
which contains ay, and that ¢ has order N in C/L’. Show that the L’ in (5.2) are in 
one-to-one correspondence with the right cosets of [,(N) in A"(N, {1}, Z). Use this 
to give another proof of Proposition 43 by comparing (5.9) and (5.27). 


8. Let fe M,(To(p)). 
(a) Show that yp = | and 7;=(?2 ~'), 0 <j <p, are right coset representatives for 
I modulo Ip(p). 
(b) Let Tr(/) be defined by: Tr(/') = Z?.. f|[y,],. Show that Tr( f) eM, (L). 
(c) If f happens to be in M,(I), then show that Tri f)=(p+1)f, and 
Tr|LG OI) =p! 7, fF 


CHAPTER IV 
Modular Forms of Half Integer Weight 


Let k be a positive odd integer, and let 2 = (kK — 1)/2. In this chapter we shall 
look at modular forms of weight k/2 = 4+ 1/2, which is not an integer 
but rather half way between two integers. Roughly speaking, such a modular 
form f should satisfy f((az + b)/(cz + d)) = (cz + d)**"7f(z) for (¢ 5) in 
I = SL,(Z) or some congruence subgroup I’ < I’. However, such a simple- 
minded functional equation leads to inconsistencies (see below), basically 
because of the possible choice of two branches for the square root. A subtler 
definition is needed in order to handle the square root properly. One must 
introduce a quadratic character, corresponding to some quadratic extension 
of Q@. Roughly speaking, because of this required ‘twist’? by a quadratic 
character, the resulting forms turn out to have interesting relationships to 
the arithmetic of quadratic fields (such as L-series and class numbers). 
Moreover, recall that the Hasse—Weil L-series for our family of elliptic 
curves E,: y? =x*—n?x in the congruent number problem involved 
“twists” by quadratic characters as n varies (see Chapter IJ). It turns out 
that the critical values L(E,, 1) for this family of L-series are closely related 
to certain modular forms of half-integral weight. 

One classical reason why modular forms were studied is their use in 
investigating the number of ways of representing an integer by a quadratic 
form: m = Lj )-, Aynjn, = [n]‘A[n], where A = [Aj] is a given symmetric 
matrix, [7] is a column vector and [n]' the corresponding row vector. For 
example, the number of ways m can be represented as a sum of k squares is 
equal to the coefficient a,, in the g-expansion of O*: 


In Chapter III we saw that for k even Oe M,,.(4, x“/7), where x_, is the 
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character modulo 4 defined by x_,(”) =) = (— 1)". More generally, 
for k even one can use ©, q"!'4™ to construct modular forms of weight k/2. 
The properties of modular forms are then useful in studying the number of 
representations m = [n]'A[n]. For example, in Problem 2 of §III.5 we used 
the action of the Hecke operators on M,(I9(4)) to derive a simple formula 
for the number of ways an integer can be written as a sum of four squares. 
For more information about the connection between quadratic forms in k 
variables and modular forms of weight k/2, see, for example, [Gunning 
1962] and [Ogg 1969]. 

It is natural to ask whether a similar theory exists for quadratic forms in 
an odd number of variables. This would be a theory of modular forms of 
weight k/2 where k is an odd integer. One would want ©* for k odd to be an 
example of such a form. Early investigators of representability of an integer 
as a sum of an odd number of squares—Eisenstein, and later G. H. Hardy— 
understood the desirability of such a theory of modular forms of half- 
integral weight. But it was only recently—starting with Shimura’s 1973 
Annals paper—that major advances have been made toward a systematic 
theory of such forms which rivals in elegance and beauty the much older 
theory of forms of integral weight. 

In this chapter we shall first present the basic definitions and elementary 
properties of forms of half-integral weight, largely following [Shimura 1973a, 
1973b], but with slightly different notation. We shall discuss examples in 
some detail. But when we come to the fundamental theorems of Shimura 
and Waldspurger, we shall not give the proofs, which go beyond the scope 
of an introductory text. Rather, we shall motivate those theorems using the 
examples. Finally, we shall conclude by returning to the congruent number 
problem and Tunnell’s theorem, which we used in Chapter I to motivate 
our study of elliptic curves. 


§1. Definitions and examples 


We shall always take the branch of the square root having argument in 
(—2/2, x/2]. Thus, ia is a holomorphic function on the complex plane with 
the negative real axis (— 00, 0] removed. It takes positive reals to positive 
reals, complex numbers in the upper half-plane to the first quadrant, and 
complex numbers in the lower half-plane to the fourth quadrant. For any 
integer k, we define z"? to mean (,/z)*. 

Whenever we have a transformation rule such as f(yz) = (cz + d)*f(z) 
(where y = (2 5), yz = (az + b)/(cz +. d)), we call the term (cz + d)* the 
“‘automorphy factor’’. It depends on y and on z. That is, an automorphy 
factor J(y, z) for anonzero function fhas the property that f(yz) = J(y, z)f(2), 
for zé H and y in some matrix group. Because 
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S(aBz) _ f(aBz)  f(Bz) 
SQ) f(Bz) f2)’ 


it follows that any automorphy factor J(y, z) must satisfy 
I(aB, z) = J(a, Bz) J(B, z). (1.1) 


For example, J(y, z) =cz+d for y= (¢ ")eT = SL,(Z) satisfies (1.1). If 
—1=(¢ °,)is in our matrix group, then J(y, z) must also satisfy: J(—1, z) 
= 1. Another example of an automorphy factor is J(y, z) =j(y, z) = 
(Sez! Vez + d,y = (4 5)eT)(4), which is the automorphy factor for @(z) = 
weg" g =e?" (see SIII.4). 

Suppose that, in defining modular forms of weight k/2 for k an odd 
integer, we try the most obvious thing: we look for functions / satisfying 
S(yz) = (cz + d)** f(z). However, J(y, z) = (cz + d)*? cannot be an auto- 
morphy factor of a nonzero function for any congruence subgroup I’ < 
and Any odd intone k. To see this, suppose T’ >T(N), N> 2. Let «= 

NN 1 ty) B = (ny 9). Then «, Bel’, and (1.1) would require the k-th power 
of the following satality of holomorphic functions on H: 


V(N? — 2N)z+1—N = /-Nzf(—Nz +1) +1—N-J—-Nz¥1. (1.2) 


Clearly, the square of (1.2) holds; thus (1.2) itself holds up to a sign. Since 
both expressions in the radicals on the right are in the lower half-plane, the 
right side is the product of two complex numbers in the fourth quadrant; 
but the left side is in the first quadrant, since (V? — 2N)z+1—NeH. 
Hence, (1.2) is off by a factor of —1, and (1.1), which is the k-th power of 
(1.2), is also off by — 1 if k is odd. Thus, we cannot have J(y, z) = (ez + d)*?. 

The natural way out of this difficulty is to force (1.1) to hold by simply 
defining the automorphy factor J(y, z) to be the k-th power of 


b 
IQ), 2) B O(y2)/O() = (5 i) cz+d for y 6 ‘eto (1.3) 


That is, for a congruence subgroup I’ < I',(4), one defines a modular form 
of weight 4/2 to be a holomorphic function on H which transforms like the 
k-th power of ©(z) under any fractional linear transformation in T’ (and which 
is holomorphic at the cusps, in a sense to be explained below). 

Recall that (4) is the Legendre symbol, defined for a positive odd prime 
d as the usual quadratic residue symbol, then for all positive odd d by 
UI platy. and finally for negative odd d as (jj) if c > 0 and —(q) if 
c <0. (Also, (-8-) = 1.) Further recall that we define ¢, = 1 if d= 1 (mod 4) 
and ¢, = iif d = —1 (mod 4). Thus, ¢7 = x_,(d) = (—1)¢"!? (note that this 
holds for negative as well as positive d). 

Thus, if we define f(z)|[y]y2 =J(. 2) “f(yz) for yeTo(4) and k odd, we 
shall require a modular form of weight k/2 for I’ to be fixed by [y],/2 for 
yel”. As in the case of integer weight, we shall want to define [a],,. for 
arbitrary matrices « € GL3 (Q) with rational entries and positive determinant. 
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But since the automorphy factor /(y, z) is defined only for ye T9(4), we have 
no preferred branch of the square root for arbitrary «€GL}(Q). This 
circumstance requires us to work with a bigger group than GL3(Q), a group 
G that contains two “copies” of each «€ GL3 (Q), one corresponding to each 
branch of the square root of cz + d. The example of the automorphy factor 
J(y, 2), which is a square root of (})(cz + d), shows that we also should 
allow square roots of —(cz + d). Thus, we shall actually define G to be a 
four-sheeted covering of GL3(Q). We now give the precise definition of the 
group G. 

Let T < C denote the group of fourth roots of unity: T= {+1, +i}. We 
now define G to be the set of all ordered pairs (a, @(z)), where « = (4% 5)e 
GL}(Q) and ¢(z) is a holomorphic function on H such that 


cz+d 


,/det a 


for some te T? = {+1}. That is, for each «€GL3(Q) and for each fixed 
t= +1, there are two possible elements (a, +(z))eG. We define the 
product of two elements of G as follows: 


(a, b(z))(B, W(Z)) = (a8, (Bz) (z)). (1.4) 


Remark. We could define G in the same way but with T defined to be the 
group of all roots of unity or even (as in [Shimura 1973a]) the group of all 
complex numbers of absolute value 1. However, for our purposes we only 
need fourth roots of unity. 


o(z) =t 


Proposition 1. G is a group under the operation (1.4). 


ProoF. We first check closure, i.e., that the right side of (1.4) belongs to G. 
Ifa=(¢ 2). B= (§ 4), we have 


(b(Bz)W(z))* = t,(cBz + d)ta(gz + h)/,/det « det B 
= t11,(c(ez +f) + d(gz + h))/,/det a 
= t,t,((ce + dg)z + cf + dh)/,/det af, 


which shows that (af, ¢(Bz)w(z))e€G. Associativity is immediate. It is also 
obvious that ((§ ?), 1)€G serves as the identity element. Finally, to find an 
inverse of (a, $(z)), where « = (% >), we write D = det a,’ = Da 1 =(4, 7), 
and look for (a~', W(z)) such that: d(a~!z)y(z) = 1. That is, we set w(z) = 
1/@(«~*z), and then must check that («7!, w(z))eG. But 


W(z)? = i/( (45%, + a) yD) = *VD(-ez + a)/(ad — bc) 
= : (-5 i. 5) vee at, 


which is of the required form. This completes the proof. oO 
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We have a homomorphism 
P:G>GL}(Q) 


which simply projects onto the first part of the pair: («, @(z)) «. The kernel 
of P is the set of all (1, #(z)) eG. Since $(z)* = t for (1, (z)) EG, it follows 
that ¢(z) must be a constant function equal to a fourth root of unity. Thus, 
if we map T to G by tr(1, 1), we have an exact sequence 


1>T+G>5GLi(Q)—1; 


in other words, the kernal of P is the image of T under ¢+>(1, 2). 

Similarly, if « = a(§ }) for aeQ, the inverse image of « under P is the set 
of all pairs (a, t) for te T, since in that case we again find that $(z)? must 
be +1. 

We let G' = P-\(L) be the set of all pairs (a, 6(z))€G such that «eT = 
SL,(Z). G' is clearly a subgroup. If € = (a, #(z))€G, we shall sometimes 
use the notation €z to mean the same as az for ze H. 

For ¢ = (a, $(z))€G and any integer k, we define an operator [€],,. on 
functions fon the upper half-plane H by the rule 


[Lue eS (az)o(z). (1.5) 


This gives an action of the group G on the space of such functions, i.e., we 


have (f@|LE]e2)| [E22 = fale Ca dij2> because of (1.4). 
Now let I’ be a subgroup of I)(4). Then /(y, z) is defined for elements 
yell’ (see (1.3)). We define 


Il’ = {0/0 Dyer}. (1.6) 


Clearly fF’ is a subgroup of G (because of (4.13) in §III.4) which is isomorphic 
to I’ under the projection P. Let L denote the map from I,(4) to G which 
takes y to 


F=(,/0, DEG. 
Then P and L are mutually inverse isomorphisms from I,(4) to I')(4). We 


call L a “lifting” or “‘section” of the projection P. In our notation we are 
using tildes to denote this lifting from I'y(4) to G: 


Li y 7 = (50, 2); 
P:(y, JQ, ZY. 
Suppose that /(z) is a meromorphic function on H which is invariant under 
[9]x2 for yel’, where I’ is a subgroup of finite index in I)(4). We now 
describe what it means for f(z) to be meromorphic, holomorphic, or vanish 


at a cusp seQ u {oo}. We first treat the cusp oo. Since I’ has finite index 
in I)(4) and hence in I, its intersection with 
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must be of the form {+(§ {)/} if —-lel’ and either {7b 4 or ((-(G DY} 
if —1¢I°’. (Wealways take h > 0.) Since [ — 1] yi is the identity, and j((§ *), z) 
= 1, in all cases f is invariant under z++z + A, and so has an expansion in 
powers of q, = e777". As in the case of integer weight, we say that / is 
meromorphic at oo if only finitely many negative powers of g, occur, we 
say that fis holomorphic at 00 if no negative power of q, occurs, and we define 
f() to be the constant term when fis holomorphic at oo. 

Now suppose that se Q@ U {00}. Let s= aco, aeT, and let on = (, (2) be 
any element of G' which projects to «: P(€) = 
and let G} = {ne€G*|noo = 00}. We have 


ef 3 


because I, = {+(6 /)}, and the $(z) for (§ {) must be a constant function f, 
as noted above. Now €~'I1é is contained in G' and fixes 0, i.e., €7 se ae 
G2. Moreover, P gives an isomorphism from €'Tjé to a "Tia < I,,. Since 
I’ has finite index in T, it follows that «'I’‘a is of one of the forms { + we ty}, 
{(4 "Ft, or {(—(§ 71))}. Thus, for some te T we have 


woe fe, 19 
+¢ = {(2( Teg ee 


Given s and €, h > 0 is determined by requiring (§ {) to bea generator of 
a 'T’a modulo +1; then ¢ is determined, since P is one-to-one on €~ is Wir 
i.e., we can find ¢ by applying the lifting map L to ta(§ fa tel. 


cil 


Proposition 2. The element ((j "), t)€ G3, depends only on the Y’-equivalence 
class of the cusp s. 


Proor. First suppose that € is replaced by another element ¢, = («,, ¢,(z))€ 
G' such that s = «,00. Then €'€,€G' fixes 00, and so it is of the form 
(+(5 1), f1). Then 


+E HS, = (EE) (LETTE 1G) 


(6 1G DYHG 7h) 


But ((3 1), ) commutes with ((} “), , so conjugating by €~*é, does not 
affect ((4 "), 2). 
Now suppose that s; = ys = (76) 00, where yeI’, ¥ = (7, j(y, z)) ET’. Note 
that I = yIyy"', and so ly = #17". Thus, 
+56) TS 56 = LEGIT GT DHE = AOE, 


and we again obtain the same ((§ “), £). This completes the proof. oO 
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We are now ready to define meromorphicity/holomorphicity/vanishing at 
the cusp s of I’. Suppose f is invariant under [Fw for eI’. Let s = E00, 
and set g = f|[€],,2. Then for any element +é-'ée +€'T’E we have 

| era =f|5 ee =f LA =J. 


That is, g is invariant under [((§ 1), 4) ]y2: 


I(( i): ‘| ao Me 


Write ¢* = e?*"", where r = 0, 4, 4, or 3. Then e 2*"*/"g(z) is invariant under 
zt+z + h, and so we obtain a Fourier series expansion e *”""*"g(z) = La,q7?, 
1.€., 


g(z) = g(2) 


g(z) = a e2tizintr)/h 
Da, 


We say that fis meromorphic at s if a, = 0 for all but finitely many n < 0, 
and that fis holomorphic at s if a, = 0 for all n < 0. If fis holomorphic at s, 
we define f(s) = lim,_;,. g(z). Automatically f(s) = 0 if r #0, since in that 
case the first term is aje?""" If r = 0, then f(s) = ap. It is easy to see that 
these definitions depend only on the I’-equivalence class of s, i.e., g = 
f\LElu2 may be replaced by g, =/|[Fé,]y2, where FET’ and €,00 = s (see 
the proof of Proposition 2, and also the proof of Proposition 16 in §III.3). 

It should be noted, however, that there is some indeterminacy in the value 
J(s) of a modular form at a cusp. Namely, if we replace eG" by ((4 9), £,)é, 
i i by ¢,*. If k/2 is a half- 
integer, this may alter the value by a power of i; if k/2 is an odd integer, then 
J(s) may be defined only up to +1 (compare with the proof of Proposition 
16); and if k/2 is an even integer, then f(s) is well defined in all cases. 

Given a cusp s of I’ and an integer k, we say that s is k-irregular if r £ 0, 
and we say that s is k-regular if r = 0, i.e., if f* = 1. Thus, if fis holomorphic 
at the cusps, it automatically vanishes at all k-irregular cusps. 

Note that whether a given cusp s of I’ is k-regular or k-irregular depends 
only on k modulo 4. That is, if ¢ = +i, then the cusp is k-irregular unless 
k/2 is an even integer; if t = — 1, then it is k-irregular unless k/2 is an integer; 
and if ¢ = 1, then it is always k-regular. In the case when k/2 is an odd integer, 
the terminology agrees with the definition of regular and irregular cusps in 
Problem 2 of §II.3. 


Definitions. Let k be any integer, and let I’ < I'9(4) be a subgroup of finite 
index. Let f(z) be a meromorphic function on the upper half-plane WH which 
is invariant under Lee for all }eI’. We say that f(z) is a modular function 
of weight k/2 for I” if f is meromorphic at every cusp of I’. We say that 
such an f(z) is a modular form and write fe M,)2(T’), if it is holomorphic 
on A and at every cusp. We say that a modular form fis a cusp form and 
write fe Syo(F’), if it vanishes at every cusp. 
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Now let N be a positive multiple of 4, so that (NV) < Py(4). Let x be 
a character of (Z/NZ)*. We let My/2(I'o(N), x) denote the subspace of 
M,2(1,(N)) consisting of f such that for y = (7 )eTy(N) 


f\y2 = xa) (1.7) 


We define S,o(I9(N), x) = Sa(T,(N)) 0 My2(Fo(N), x). The exact same 
argument as in the integer weight case shows that 


Mir (I, (N)) = @, Mi (T(N), 2: 


Furthermore, it follows immediately from the definitions that if y, and 
X2 are Dirichlet characters modulo N, then 


SE My(To(N), xi) (i= 1,2) implies fi fr. € Ma, +42 Fo), X1X2)- 
(1.8) 


Notice that for any ke Z, we have M,,. (Fo(N), x) = 0 if x is an odd char- 
acter, as we see by substituting —1 for y in (1.7): f[[(6 -2), D]w2 =S= 
x(—Df. For example, since the trivial character y= 1 is the only even 
character of (Z/4Z)*, this means that 


M,2(F,(4)) = Mya(To(4), 1) a Myj2(To(4)). 


If 4|N, recall that y_, denotes the character modulo N defined by x_1() = 
(—1)"? for ne(Z/NZ)*. 

Notice that in our definitions k is any integer, not necessarily odd. For k 
even, let us compare the above definitions of M,p2(T ‘> Seo( s My2(To(N ), 
D> Sya(Po(N), x) with the definitions of M,,.(I), S20), Myj(N, 2), 
Sy2(N, 7) in Chapter III. First, for any € = (a, O(z))EG, «= ( 4), o(z)? = 
t(cz + d)/,/det «, note that f(z)|[E]a2 = (2) “f(az) = 1 *?f(z)|[a]iy2, 80 
that [€],)2 for k/2€Z differs only by a root of unity from the operator [oJx)2 
defined in the last chapter. It immediately follows that for k/2¢Z the cusp 
condition defined in the last chapter is equivalent to the cusp condition 
defined above. 

There is a slight difference, however, between the condition that f be 
invariant under [7],,. for y = (4 ")eT’ and the condition that f be invariant 
under [}],)2. Namely, S|we = j(y, 2)"“f2) = (G(cz + a)“ fqz2)) = 
"°F |Ly]a2- Thus, if k/2 is odd, then [}],,2 differs from Ly] by x-1(d). 
From this discussion we conclude 


Proposition 3. Let 4|N, k/2eZ. Then 
My2(Fo(N), n= M2 (N, xin, Syo(To(N), DO = Sal, ten: 


It is now not hard to describe the structure of M,2(T, 0(4)). If we have 
a polynomial P(X,, ..., X,)€C[X,, ..., X,,] and assign “weights” w, to 
X;, then we say that a monomial I X/' has weight w = =n;w;,, and we say 
that a polynomial P has pure weight w if every monomial which occurs in 
P with nonzero coefficient has weight w. 
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Proposition 4. Let @(z)==22_,9", F(z)= Lrrgoaa M1 (M)Q", GQ = e7™, 
Assign weight 1/2 to © and weight 2 to F. Then My;.(1(4)) is the space of all 
polynomials in CLO, F'] having pure weight k/2. 


Proor. In Chapter II we found that for k/2eZ, the space M,.(N, x7) 
consists of polynomials in © and F having pure weight k/2 (see Problems 
17-18 of §III.3). This gives Proposition 4 when k/2eZ. Next, by the defini- 
tion of [F],j2, we have ©|[F],,. = © for yeT,(4). © is holomorphic at the 
cusps, because we checked holomorphicity of ©? at the cusps, and if © had 
a singularity at s, so would ©?. Thus, Oe M,p(To(4)). It now follows by 
(1.8) that any polynomial in © and F of pure weight /2 is in My2(Fo(4)). 
Conversely, suppose that fe M,/.(I'9(4)), where k is odd. Then f© is in 
Me+1y2(Po(4)), and so can be written in the form f O=aF“*""* + ©? P(O, F), 
where ae€C is a constant which equals 0 if 44 k + 1 and P(@, F) has pure 
weight (k — 1)/2. Then f— OP(O, F) = aF 14 /@ € My (To(4)). But be- 
cause © vanishes at the cusp —4, while F does not, this form satisfies the 
holomorphicity condition at —4 only if a=0. Thus, f= OP(O, F), and 
the proof is complete. Oo 


Corollary. Dim M,,. (T9(4)) = 1 + [k/4]. 


PROBLEMS 


1. Let y = (¢ 5)eT)(4), and let p = (3 9), m4). Check that peG. 
(a) Compute pjp™'. 
(b) If yeTo(4m), then y, = (" 2)y@ 9)! is in To(4). Compare pip~* with j,. Show 
that they are always the same if m is a perfect square, but that otherwise they 
differ by a sign for certain y€T)(4m). 


2. Let y =( *)eF)(4), and let p = (2 ~3), N"*,/z). Check that peG. 
(a) Compute pjp™’. 
(b) Suppose that 4|N and yeT)(N). Then y, =(X “aN 0)’ €To(N). Compare 
pip * with j,. 
3. Find A and ¢ for each cusp of T)(4). 


4. Let P’ <Ip(4) be a subgroup of finite index. Let k/2€Z. Show that if ™’ <1, (4), 
then My(T) = M,(T’). Otherwise, let y be the unique nontrivial character of 
V/V’ AT, (4); show that M,p(P) = Myo, 77) in the notation of §III.3 (see the 
discussion following Proposition 27). 


5. (a) Describe Sy2(To(4)), and find its dimension. 
(b) Show that for k > 5, the codimension of Swa(To(4)) in My(To(4)) is equal to 
the number of k-regular cusps. 
(c) Find an element 2a,g" in S, 321 0(4)) such that a, = 1 and a, = 0 (there is only 
one). 


6. Let 4|N, and define yy by xy(d) = (3) if g.c.d.(N, d) = 1, xv(d) = 0 ifg.c.d.(N, d)>1. 
(a) Show that yy is a Dirichlet character modulo N. 
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(b) Let p be as in Problem 2 above, let y be an arbitrary Dirichlet character modulo 
N, and suppose that fe My.(I'9(N), 4). Show that f|[p]y2¢My2(To(N), Xx). 


7. Find the value of Oc M,. (T',(4)) at the three cusps of I'p(4). 
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Recall the functions G,(z) and £,(z) in M,(T) that we defined in §III.2 for 
k > 4 an even integer. We set G,(z) = Z(mz +n)", where the sum is over 
m,neZ not both zero; and then E,(z) was obtained by dividing by G,(i00) = 
2¢(k). Alternately, we can obtain E,(z) by the same type of summation 
X(mz + n)* if we sum only over pairs m, n which have no common factor 
(see Problem 1 of §III.2): 


1 


E,(z) = i rears 


(2.1) 
where the sum is over m, ne Z with g.c.d.(m, n) = 1 and only one pair taken 
from (m, n), (—m, —n). In §HI.2 we obtained the g-expansion 


2k ~ n niz 
E@)=1-F Loata", ga er. 
kn=1 


We now want to give an analogous construction, with k replaced by a 
half-integer k/2 and I replaced by I'p(4). To do this, we give a more intrinsic 
description of the sum (2.1). Notice that if we complete the row (m, n) to 
form a matrix y = (4, °)eI, which can be done because g.c.d.(m, n) = 1, 
then the summand in (2.1) is the reciprocal of the automorphy factor J,(y, z) 
for functions in M,(I). That is, such functions f satisfy: f(yz) = J,(y, z) f(z) 
for J,(y, Z) = (mz +n)‘. In (2.1) we are summing 1/J,(y, z) over all equiva- 
lence classes of yeI’, where y, ~ y, if y, and y, have the same bottom row 
up to a sign, i.e., if yy = +(6 /)y, for some +(3 /)eT,,. In other words, 
we may regard J,(y, z) as a function on the set T’,,\V of right cosets, and then 
rewrite (2.1) as follows: 


E@= > Ay, z)*, k= 4even. (2.2) 
yeTg\t 


It is now possible to give a proof of the invariance of E,(z) under [y, ], 
for y, €T which easily generalizes to other such sums of automorphy factors. 
Namely, note that, by the definition of [y, ],, we have 


E,(z)|[y1 Je = 41; z)*E, (12) 
= T15 zy Y Cy, 12) 


yelQ\r 


= Y 401.2), 


yeT\P 


because of the general relation (1.1) satisfied by any automorphy factor. 
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But right multiplication by y, merely rearranges the right cosets I, y. Thus, 
the last sum is just a rearrangement of the sum in (2.2) for E,. Since the sum 
is absolutely convergent for k > 2, we conclude that £,|[y1], = Ex. 

We now mimic this procedure for half integral weight 4/2 and the group 
r,(4). In order to have absolute convergence, we must assume k/2 > 2, i.e., 
k>5S. 

Note that we obviously have T,(4),, =T,, = {+ })}. 


Definition. Let k be an odd integer, k > 5. 
EpZ@= Y jo,2*. (2.3) 


vel g\lo(4) 
As representatives y of I-,,\[9(4) we take one matrix (%, ?)€T'y(4) for each 
(m, n) with 4|m, g.c.d.(m, ) = 1 (in particular, m is odd), and we may specify 
n > 0so that we get only one of the pairs +(m, n). Thus 


a b v# 
Ey2(Z) = oy J »Zy . 
m on 
m,neZ,4|m,n>0,g.c.d.(m,n)=1 


Substituting the definition of j(y, z) and noting that (7)"* = (8), since k is 
odd, we obtain 


Ey@= >. (2) ek(mz + n)*?, k= 5 odd. (2.4) 
4|m,n>0 n 
g.c.d.(m,n)=1 


The fact that Fy). € My). (F,(4)) now follows by the exact same argument 
as in the case of the Eisenstein series of integral weight for [T which we 
considered above. Namely, Eyj>(z)|[F1Jk2 =J01, 2) “Ex2(712), which just 
gives a rearrangement of the sum (2.3). Finally, the cusp condition is routine 
to check, so we shall leave that as an exercise (see below). 

Unlike the case of T, where there is only one Eisenstein series of each 
weight, E,/.(z) has a companion Eisenstein series /,,2(z), defined by 


(C=) 


That there are two basic Eisenstein series for each half-integer 4/2 is related 
to the fact that [,(4) has two regular cusps (see Problem 3 of §1 and the 
discussion at the very end of §IJI.5). To see that Fj. € My) (f'o(4)), we apply 
Problem 6 of §1 in the case N = 4, y= 1. 

To write F,,. more explicitly, we compute 


Fiy2(Z) = (22) “? Exja(— 1/42) 


Fu = Exj2 


oo (4z)*? 4|m,n>0 n "(—m/4z + ni? 


g.c.d.(m,nj=1 


m 1 
= Pee o (Jet ae 4nz)F?’ 
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where we have used the fact that »>0 to write a 4z,./—m/4z +n= 
.f —m + 4nz. We now replace m by — 4m, so that the sum becomes a summa- 
tion over all meZ with g.c.d.(4m, n) = 1, i.e., n odd and g.c.d.(m, n) = 1. 
We obtain 


_ 94-k/2 —M\ ok 1 
Fyj2(Z) =2 i ag ( n yes (nz ne m)ki2* (2.5) 
n>Oodd 


g.c.d.(m,n)=1 


We next compute the q-expansions of F,,. and F,,.. We proceed much as 
in the computation of the g-expansion of E, in §II.2, except that there are 
added complications. 

The derivation of the q-expansion for F,,.(z) turns out to be slightly less 
tedious than for E,/.(z). Hence, we shall give that derivation, and omit the 
analogous proof of the q-expansion formula for E,,.(z). 

To compute the g-expansion of F,,.(z), we start with the following relation, 
which was proved in Problem 8(c) of §II.4 for any a > 1, ze H (where we 
have replaced a, s by z, a here): 


YY C+ = Cape tr @ ¥. ete, (2.6) 
h=— 0 1=1 
(Note: for ze H we define z* = e*'°**, where log z denotes the branch having 
imaginary part in (0, 2) for ze H.) 
Since the sum (2.5) for /,2(z) is absolutely convergent, we may order the 


terms as we please. If we let j= 0, 1, ..., 7— 1 and write m= —j + nh, 
heZ, we obtain 
Fiy2(Z) 
SS ee (2) Y (az — jf + nh)y*? 
n>Oodd 0<j<n\" J p=—a@ 
- -1 0 
_ 9~-k/2 k—k/2 Jj k/2_,—nik/4 k (k/2)—1 ,—2nilj[n ,2nilz 
=? ye az ({)en e r(§) ome e ernie. 
n> <j<n 1=1 


by (2.6) with z replaced by z — j/n and a replaced by k/2. 
Thus, F,2(z) = 22, bq', q = e?*", where 


pe sl [(k/2)-1 Y ekn "2 Y Gaae (2 7) 
rer" jeoeaa.” oxj<n\t , , 

We now simplify the expression (2.7) in the important special case when 
/ is squarefree. 

Let meZ be squarefree. For j > 0 odd we let y,,(j) denote ). Thus, 
if j is a prime, it is y,,(j) which tells us whether j splits, remains prime, 
or ramifies in Q(,/m) (i.e., this depends on whether An(J) equals 1, —1, 
or 0, respectively). There is a unique character, also denoted y,,, which has 
conductor |m| if m = 1 mod 4 and conductor |4m| if m = 2 or 3 mod 4 and 
which agrees with y,,(j) = (#) for j > 0 odd. We can express y,, in terms of 
the usual Legendre symbol as follows: 
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m= (5) ifm = 1 mod 4: 


Im) = i) a) ifm = —1 mod 4; 
i J \\m| 


Am) = a (5 md ifm = 2mo = 2mod 4, 


where we define (|!) = (—1)4~!” for j odd and (;') = 0 for j even, and 
(2) = (- Ns an jodd and (2) = 0 for j even. 
Note that 


%r—-l)=1 if m>O, and y,(-l)=—-1 if m<0. (2.8) 
Recall that we define 2 = (k — 1)/2. 


Proposition 5. The Eisenstein series Fyj, has q-expansion coefficient b, which 
for | squarefree is equal to L(y,-1)4, 1 — A) times a factor that depends only 
on 2 and on 1 mod 8, but not on 1 itself. This factor is given in (2.16) below. 


We first prove a lemma. 


Lemma. Let n = ngni, where ng is squarefree. Then for | squarefree, the inner 
sum in (2.7) vanishes unless n,|l, and if n,|/ it equals 


by (Gr)momen ns. 2.9) 
10) 


where is the Mobius function (equal to 0 ifn, is not squarefree and equal to 
(—1)' ifn, is the product of r distinct primes). 


PRoor oF LEMMA. First, if 2, = 1, then (2) has conductor n = ny. If /has a 
common factor d with n, then, combining terms which differ by multiples of 
n/d, we see that the inner sum in (2.7) is zero. On the other hand, if/e(Z/nZ)*, 
then making the change of variables j’ = —/j leads to (G)Z (f)e?™/"". This 
Gauss sum is well known to equal Ena/n (see, e.g., §5.2 and §5.4 of [Borevich 
and Shafarevich 1966]). This proves the lemma when n, = 1. 

We next show that the inner sum is zero if g.c.d.(m, m,) =d> 1. Note 


that 
(2) o (4) feed(pay ay 
‘ 0 otherwise; 
and it follows that (4) = (). Combining terms with j which differ by 


multiples of n/d? and using ‘the fact that d?// (since / is squarefree), so that 
eee nan — 0 we find that the inner sum is zero in this case. 
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Now suppose that 1, > 1 is prime to mo. Let p, run through all primes 
dividing n,: n, = I1p%. Set g, = p%; and let 6,(/) = 0 if p,|j and 6,(j) = 1 
if p,} j. Then (4) = G4), 6,U). Any je Z/nZ can be written uniquely in 
the form 

j=joni+ Ding, 0<jo<mo, 05i,< 4. 
Then 


en 2niliin  g~2niliolno Il en 2niliyiay 


v 


(:) = (x) 118i) = (22) avo. 


Thus, the inner sum in (2.7) becomes 


Jo 0 v iy 


Further note that 


The first sum is by, (a)V/No , by the first part of the proof (the case n = No). 
The sum inside the product is 

qn qlPy-1 

y en 2nilivias — en 2nilpyiylay 

dy=0 jy=0 
Here the first sum is zero, because q? f I. The second sum is also zero, unless 
q2\lp,; this can happen only if g, = p, and p,|/. Thus, the expression (2.10) 
is zero unless each q, = p,, i.e., n, is squarefree, and n,|/. If n,|/, then each 
sum inside the product in (2.10) is equal to —p,, and we obtain the following 
expression for the desired sum: 


by (5. wie Tp. 
0 v 


(Note that ¢,, = ¢,.) The latter product is x(n,)n,. This completes the proof 
of the lemma. Oo 


We are now ready to compute b, for / squarefree. 
We replace the inner sum in (2.7) by its value given in the lemma, and sum 
over all n with a fixed n). According to the lemma, we obtain 


ae (noni? ( mannan: 


- -! 
1 ky pnik/4 <a 
TG)e no>Oodd ny |! No 

and squarefree n, odd 


Since &,,2 = &,, the expression inside the summation over Mp is equal to 


—l 
k+l 1-K/2 1-k 
Eno (=m si x H(n,)ny*. 


oddn,|I 


qehl2pki2-1 


The sum over 7,|/ is equal to Tyga p:(1 — p’~*); since it does not depend on 
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No, we can pull it outside the summation over ny. In addition, we use the 
relation (recall: 4 = (k — 1)/2) 


“(=e -G'() 


As a result, we obtain 


b qe kl2 pki2-1 1 eT _ 1 a / ay ) 7 
1= P@ emt II ¢ pe) ee (=) (i) . (2.11) 
podd and squarefree 

The expression (%3)“(a5) is what we denoted y,_;,4,(9). As remarked above, 
these are the values at odd squarefree positive of a primitive character 
X-141 Of conductor N =/ if (—1)‘/=1 mod 4 and conductor N = 4/ if 
(—1)*/ = 2 or 3 mod 4. 

Thus, the sum in (2.11) looks very much like the value at s =A of the 
Dirichlet L-series 


Lay 9S DY H-yalnyn’, Res> 1. (2.12) 
n=1 


The difference between the two sums is that the sum in (2.11) only ranges 
over all odd squarefree n. But if we write an arbitrary n in the form n = nn? 
with y squarefree, we see that ¥,_1)4,(”) = %-1)4,(Mo) if n, is prime to N, and 
X-14(n) = 0 if g.c.d.(n, N) > 1, where N =/ or 4/ is the conductor. 
First suppose that N = 4/. Then, if we were to multiply the sum in (2.11) 
by 
~ @p)*=[ld-p 1 =¢2aT] 0 —p, 
n,>OprimetoN pin PIN 
we would obtain the L-series (2.12) with s = 4. In other words, we have 
. a gp hl2k/2-1 aah 
bC(24) [] = po) = eg Ly-ayr A T] A — 2). 
p|N Maje pit 
pF2 
The products cancel, except for p = 2|N on the left, so we obtain (here we 
replace k by 24 + 1): 
; = LAG as A) [4-1/2 mate 
! €(2A) ic 9724 inl 4 dyeria/2 +4) 
for (—1)4/=2 or 3 mod 4. 
If, on the other hand, N=/, i.e., (—1)*/=1 mod 4, then to obtain 
L(q-1)41 4) we must multiply the sum in (2.11) by 
1 1 
A hapdno Ll L=p 
where the first term (1 — y,_,,4,(2)2~ 7)’ is necessary in order to get the terms 
in (2.12) with n even. Since 1 — 2774 = (1 + %~1)4(2)2- (= y-ya,(2)27), 


(2.13) 


—2,? 
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we can rewrite the last product as 
(i+ X—1)4(2)2-4) II Oe ae Te” 
pti 
This leads to the equality 


— Lay, A) pr-te qt 
1 C24) 1+ -ya(Q272 TO + em 


for (—1)4/=1 mod 4. 


(2.14) 


The formulas (2.13) and (2.14) express the /-th g-expansion coefficient of 
Fj. in terms of the values at positive integers of an L-function and the 
Riemann zeta-function. Although these formulas include complicated- 
looking factors, these factors largely disappear if we use the functional 
equations for L(y,_1)4,, 8) and ¢(s) to express b, in terms of values at the 
negative integers. 

Namely, by the theorem in §II.4 we have 


C(2A) = CL — 2a)? (5 — A/T (A). 


Next, Problems 3(e) and 5(e) of §II.4 give us the functional equation for 
L(%-1)41. 5), where the functional equation is slightly different for even and 
odd characters. By (2.8) it follows that y_,)4, is an even character if A is 
even and an odd character if A is odd. We thus have 


Tl 


a-(1/2) 

Ly-1)4, A= (5) L(%-1)4 1 — A) 

(2.15) 
4 — 4a/TGA) if 2 is even; 
Td —3A/lG +44) if is odd. 

Substituting these equalities in (2.14) and using the relations ['(A) = 

TGAVG + 3A)27"1/,/m (see (4.4) of $11.4) and P(x)P — x) = 2/sin xx 

(see (4.3) of §11.4), one obtains (the details will be left to the reader): 


24-12 ¥ 
2 it(-1) = 1 mod 4; 
b= Lyi, 1-4). 1+ x-1)41(2)2 ‘ (-) 
T+ (—I'C — 2) 212-2 
Sa dd lea an if (— 1) = 2 or 3 mod 4. 


(2.16) 


Notice that the L-function value depends on the precise value of /, but all 
of the other factors are either independent of / or else only depend on the 
value of / modulo 8. This completes the proof of Proposition 5. a 


For fixed / even,as / ranges through squarefree positive integers, the even 
characters y,_,,4) run through the characters of all real quadratic fields 
Q(.//). The discriminant D of Q(//) is D=/ if /=1 mod 4 and D=4/ 
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if / = 2 or 3 mod 4. For fixed A odd, the odd characters y,_,)4, run through 
the characters corresponding to all imaginary quadratic fields Qav/—-), 
whose discriminant is D = —/if —/=1mod4and D= —4/if —/=2 or3 
mod 4. 

Thus, for (—1)/=1 mod 4, the formula (2.16) expresses the |D|-th 
g-expansion coefficient of F,,,,. in terms of the value at 1 — 4 of the L- 
function of the quadratic character of conductor | D]. 

By a similar computation, it is not hard to show that for (— 1)*/ = 2 or 3 
mod 4, |D| = 4/, the | D|-th g-expansion coefficient is given by 


: Lyin LA) 24-12 
a oa a ae (+ (1c =D): Leet 


We now look at the g-expansion coefficients of the other Eisenstein series 
E4412. By an argument similar to the derivation of (2.16)—(2.17), one shows 
that for / squarefree the /-th coefficient a, in the q-expansion of F,,. = X aq' 
is given by 


(2.17) 


1+ %-ral2)2!-* if (—1)47 = 1 mod 4; 
—1 if (—1)4/ = 2 or 3 mod 4 
(2.18) 


a=(1+(—- pinata} 


1 : ace Ay—1ya(2)2*~* 
_ L141 — 4), 21+ Xe—1yau(2)2% 
C(1 — 2A) 1 
1 ~~ 24 


if (—1)*/ = 1 mod 4; 


if (—1)*/ = 2 or 3 mod 4; 
(2.19) 


and that when (— 1)*/ = 2 or 3 mod 4, so that D = (— 1)“4/is the discriminant 
of a quadratic field, we have 
Leu, La) = gan 
(i= 22h 
(2.20) 


day = (1 + (— 14) 2-0 Ad = 29g = 


Thus, we have found that both £,,. and Fy. have |D|-th g-expansion 
coefficient equal to L(zp, | — 2) times a factor which depends only on 
and, in the case D = (— 1)*/ = 1 mod 4, on xp(2), which equals 1 if (— I= 1 
mod 8 and — 1 if (— 1)*/ = 5 mod 8. For now, let us abbreviate y = xp(2). 

The same is true of any linear combination of E,,. and Fj, so it should 
be possible to find a linear combination whose | D|-th q-expansion coefficient 
is exactly L(zp, 1 — 4). More precisely, we look for coefficients « and B 
such that 


CU — 2A) (WE gy. + BFiy2) 


has |D|-th g-expansion coefficient c,p, equal to L(xp, 1 — A) for all discrim- 
inants D of real quadratic fields if 2 is even and of imaginary quadratic 
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fields if 2 is odd. The term ¢(1 — 2A) is inserted for simplicity, in order to 
cancel the term ¢(1 — 24) in the denominators of all g-expansion coefficients 
of Ey). and Fj. Thus, we want: 


for D=(-—1)*4/=1 mod 4, 

LQ, 1-A= Cip| 
= 1 142! "4y 1 oSoe ay 
= Lp, 1 — a(5 1+277y Bey 1+(-—1)4i 1+274y }’ 


for D=(—1)*4l,  (—1)47=2o0r3 mod 4, 


L(xyp, 1 — A= Cip| 
1 1 — 217-24 1 94-C/2) ) 


= Ep, I a3 [a2 ieee (=o 
Dividing by L(yp, 1 — 2), we obtain a 2 x 2 system of equations in the 
unknowns «, f. Because y depends on / modulo 8, it is not a priori clear 
that the « and f for which aayp, + Bbip, = L(xp, 1 — A)/ECL — 24) will be 
independent of /. However, when we solve the equations for « and B, we 
find that the solution does not actually contain x, and so is independent of /. 
Namely, we obtain 


a=1, p=(14+(—-14%27-0, 


We conclude 


Proposition 6. Let 2 = (k — 1)/2 > 2, and let Ex), Fz € Myj2(To(4)) be defined 
by (2.4)-(2.5). Then 


Ayo GEO — 24) (Enya + (+ 22-67 Fiya) € Muyo (Fo(4)) 


has the following property: if D = (—1)*l or (—1)*4l, 1 > 0, is the discriminant 
of a quadratic field and xp is the corresponding character, then the \D\-th 
g-expansion coefficient of Hyj. is equal to L(xp, 1 — A). 


This proposition is due to H. Cohen [1975]. (His notation is slightly 
different from ours.) It can be viewed as a prototype for the theorem of 
Waldspurger—Tunnell which we shall discuss later. The Waldspurger— 
Tunnell theorem exhibits a modular form of weight 3 (think of 4 as being 
equal to | in Proposition 6) for (128) such that the square of its n-th 
q-expansion coefficient is a certain nonzero factor times L(E,, 1), where 
L(E, s) is the Hasse—Weil L-function of an elliptic curve E (see §II.5) and 
E, is the elliptic curve y? = x? — n?x. Thus, the g-expansion coefficients of 
this particular form of half integral weight are closely related to all of the 
critical values L(E,,, 1) which we encountered in Chapter II. 

If we were allowed to take 4 =1 in Proposition 6, then the |D|-th q- 
expansion coefficient would be L(7p, 0), which differs by a simple nonzero 
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factor from L(zp, |) (because of the functional equation), and is essentially 
equal to the class number of the quadratic imaginary field Q(,/D) (here 
D = —l/or —4/). We will say more about the case 2 = 1 below. 

For another proof of Proposition 6, using so-called “Jacobi forms,” see 
[Eichler and Zagier 1984]. 

We now discuss some interesting consequences of Proposition 6. Since 
any element Ay € Myj2(T9(4)) can be expressed as a polynomial in 0 = 
re _q” and F= 2,954.9, (n)q" (see Proposition 4), it follows that there 
are formulas expressing L(yp, 1 — 4) in terms of o,(m) and the number of 
ways n can be expressed as a sum of r squares. We illustrate with the simple 
case k = 5, 2 = 2. In this case Ms,(I'o(4)) is spanned by ©* and OF, and a 
comparison of the constant coefficients and the coefficients of the first 
power of q shows that Hs. = 7350° — OF (see the exercises below). This 
yields the following proposition. 


Proposition 7. Let s,(n) denote the number of ways n can be written as a sum 
of r squares ; thus ©° = Xs5(n)q". Let D be the discriminant of a real quadratic 
field, and let xp be the corresponding character. Then 


1 


Ltn = 7558s) -g LD —?). 


6 I< 
D—j? odd 

Many other relations of this sort can be derived, in much the same way 
as we used the structure of M/,(T) in the exercises in §III.2 to derive identities 
between various a,(7). For details, see [Cohen 1975]. These relations can be 
viewed as a generalization to A > | of the so-called “‘class number relations” 
of Kronecker and Hurwitz. (For a statement and proof of the class number 
relations, see, for example, Zagier’s appendix to [Lang 1976] and his correc- 
tion following the article [Zagier 1977].) The class number relations corre- 
spond to the case A = 1, 1.e., L(yp, 1 — 2) = L(y, 0). But 2 = 1 falls outside 
the range of applicability of Proposition 6. 

To get at the class number relations from the point of view of forms of 
half integral weight, one must study a more complicated function H3,., which 
transforms under I,(4) like a form of weight 3 but which is not analytic. 
The |D|-th coefficient of H,, is essentially the class number of the imaginary 
quadratic field Q(,/D). The study of H3)) is due to Zagier [1975a]. 

In some sense, the situation with H3,, is analogous to the situation with 
E, in $111.2. In both cases, when we lower the weight just below the minimum 
weight necessary for absolute convergence, problems arise and we no longer 
have a true modular form. The study of EF, and 43). is much subtler than 
that of £, and H,). for k > 2 and k/2 > 2, respectively. But in both cases 
the Eisenstein series, though they fail to be modular forms, have a special 
importance: we saw that the functional equation for EF, led to the functional 
equation for the discriminant form A(z)eS,,(I); and H%3,. has as its coeffi- 
cients the class numbers of all imaginary quadratic fields. 
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We next examine the quite different question of how the nonsquarefree 
q-expansion coefficients of E,,., Aj2, and Hy. can be determined. Again 
we shall give the details only for F,,. and shall omit the similar computations 
for E,)>. 

We proceed one prime at a time. That is, for each prime p we express 6, 
in terms of b,, where / = p?*/, and p*{/). Finally, combining the different 
primes p, we shall be able to express }, in terms of b, when / = /, l? and I, 
is squarefree. This information can be conveniently summarized in the form 
of an Euler product for 27°; by, pl, *, as we shall see later. 

The easiest case is p = 2. Suppose that /= 4"/,, 4) /). We return to our 
earlier computation of }, in (2.7). The general formula (2.7) is valid for all /, 
not necessarily squarefree. If we replace / by 4/ in (2.7), the double sum 
does not change, because replacing / by 4/ is equivalent to replacing / in the 
inner sum by 4j, which runs through Z/nZ as j does, since n is odd. Thus, 
by, = 4*?-1b,, and so for / = 4°/, we have 


b= See (2.21) 
This case p = 2 can be summarized as follows: for any /o, 
> ~sy 1 
Y by av2 = bers (2.22) 
v=0 ~~ 


Comparing coefficients of 2~*” on both sides of (2.22) shows that (2.22) is 
equivalent to (2.21). (Note that we do not actually need to assume that 4//.) 

Now suppose that p is odd. Let / = p?"/y, where p? {/). We again use (2.7) 
and divide into two cases. 


Case (i) p|/o. Let fg = plo. 


We write the double sum in (2.7) in the form 2,..9.4q5,, where S, is the 
term in (2.7) corresponding to n. Let n= nop", where p* no. We fix no 
and look at the sum of the S, over all n = n)p?",h = 0, 1, 2, .... Note that 
é, = €,,and n*? = ng*?p~"*, We now evaluate the inner sum over je Z/nZ. 

First SUPPOSE that ee As representatives j of Z mod nZ we choose 
J =Jop?" +j,No as jg ranges from 0 to ny — 1 and j, ranges from 0 to p?" — 1. 
With p fixed we introduce the notation 


Ae (0° Sh ply; 
wn={" if pi. (2.23) 


Then for / > 1 we have 


(2) = Co) (x)= 94( 7) = 24>(m) 


1+2(v-h) 


Also, 


en 2Riiln e7 2tliolMog— 2nij lop 


Thus, the term S, in (2.7) corresponding to n = ny p~" is the product of 
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ane & (Blemimas, 
O<ig<ny \“o 
and 
= »\ —2nij.T pi t2-A) 
P hk Y b(j,)e 2nijlop , 
OK<i, <p2h 
In the last sum, make a change of variables by replacing —/j,/) by /, ; then 
the sum becomes 
y o2nii;p! +2(v—h) y p2niip2*20-m) 
O<j,<p2h O<j,<p2h-1 
(where j, = pj2). The first sum is zero if h > v and it is p*" if h < v; the second 
sum is zero ifh > v + land it isp" ifh<v+1. 
We conclude that all of the terms in the outer sum in (2.7) corresponding 
to n = ny p”" for fixed ny contribute 


Sho ( + y pa — p1) — pritanpaoen) 


h=1 
= $1 — pty Y pt’, 
: h=0 


We next suppose that p|no, No = prio. As representatives j of Z mod nZ 
we choose 


T= ep. +jiNo, 0 <jg < No, O<s, ape 


We have 
Cr) ()las) =) Gi) 
n P} \fo Pp} \ Mo 
Then S, is the product of 
ekyn #2 y (422) 7 2tligllig 
0<jo<iy \ Mo 
and 


O<i, <p2htl Pp 


But it is easy to see that for any h the latter sum is zero. Namely, if h < v, 
then we obtain 2;, (4) = 0; and if A> v, then the sum over any given 
residue class mod p, i.e., j, =j2 + p/3 for fixed j,, is equal to 


(2 en 2ttigto p20) y on 2miishop2-W) +1 
P 


and the latter sum is zero, because 2(v — h) + 1 < 0 and p/ i). Thus, S, = 0 
if n is divisible by an odd power of p. 


i 
O<j,<p2" 
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We conclude that for p|/) we have 


i= qhl2 [(H/2)-1 Y S,,(1 — pi-* y pr2-®, 
T (5) enikl4 Ng> 0 odd, ptng h=0 
2 


Here S,, depends only on /, and not on / = /, p?”, because for p} ny 


5 “lV j 
y (Z) en 2tiliing — y (2 jee = y (Z) ea 2Rily jing 
O<j<ny \“o, O<j<ny \ “o O<j<ny \Mo 


Thus, 
b/d, = (Ufly) 7! x pra-® = > prik-2, 


This relation between b, and 5, can be summarized in the following 
identity, which is valid for any /, with P\lo, P? Ho: 


= a 1 
b, ,2vp * =b -, 2.24 
ys ly p2 P Aral —p (1 — pr 2-5 ( ) 


To- see the equivalence of this identity to the formula derived above for 
bj/b,,, it suffices to expand (1 — p™*)"? and (1 — p*-?~*)" in geometric 
series and in that way conclude that the coefficient of p~’’ on the right in 
(2.24) is 


by, > prk-2), 


Case (ii) p{/p. Again we fix ny with p? no and consider all terms S, in 
the outer sum in (2.7) for which n = ny p”",h=0,1,2,.... 


First suppose that p/n . We take j = jy p*" + j;n9 as before, define 5(/) 
by (2.23), and find that 


S,=S.P 7 Y SUjeW2mitor20™ 


O<j,<p2h 
As before, this sum can be rewritten in the form 


oe 2s, p20) x eo 2p T7207) 
O<j,<p2h O<j,<p2h-1 
which equals zero if h > v and p”" — p?""! if h < v. Thus, 
rs nop2h = Sno (: + py po*(p* — py), (2.25) 


Now suppose that Pos No = phi. At this point we change our notation, 
replacing fig by no. That is, we shall determine £, S, prt, where now Ag is 
not divisible by p. As before, we set j =j)p?"*! + j,m9, and find that S,, 
where n = ny p”"*?, is equal to the product of 
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ela ee > (422) e7 27Holino (2.26) 
O<jo<no 0 
and 
pe (2) 2 (4) en 2rtislop2 OE (2.27) 
PY o<j,<p2ht1 P 


The latter sum is easily seen to vanish if either  < v or h > v. So the only 
nonzero S, occurs when h = vy, i.e., n= p?’*'ng. Then the sum of (2.27) 
overh=0, 1,2, ..., vis equal to 


p(=!)(2) x (Z ler 
P PY ycjep2vt1 \P 


(after the change of variables j = —/)j;); and this sum is p*” times the 
Gauss sum €,,/p. Meanwhile, (2.26) is equal to 


P¥(ConalEng)* (£) Ss, 
No 


2v+l We have 


Hence, forn = nop 
2 —ve{ —é 2 n 
py Syop2htt = P?’eps/P P ‘i (=") Dp ud (ee (Z) (") Sho: 
We check that ¢,(&pn,/én,)* (4s) (p) = (G')**? by considering the four cases 
DP, > = +1 mod 4. Hence, 


yw(2—-k)-A 


>. Sipp2htl =p Xi-1)41,(P) Sn, - 
h=0 


Thus, combining this with (2.25), we find that the total contribution to 
the outer sum in (2.7) of all terms corresponding to n of the form ng p”" or 
No p?"*} (for fixed ng not divisible by p) is equal to 


Sig ( Spree ya (iy Se poe =p), (2.28) 
h=1 


In the case / = /y, v = 0, this contribution is simply 
SoC + po*Y(—1)41,(P))- 


Let us temporarily abbreviate ¥ = %-1)4:,(p). Then we compute that 
b py? ( + preW-ay + » p"*(p" — pr ) Y Siro 
1 h=1 


ny> 0 odd, ptng 
by, (EPR). des Sa 


* a Ny> 0 odd, pt ng 

x per = » pe +p *y 

h=0 h=0 : 
+p x 
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By a simple algebraic computation, using x” = 1, we find that this last ratio 
is equal to 


v v-1 
b,/by, = os pik?) at yp} x pre). (2.29) 
h=0 h=0 
The relations (2.29) for v= 0, 1, ..., can be expressed by the identity 


A-1-s 


0 1 = X-1)41,(P)P 
¥: bpp =b Kes (2.30) 
v=0 


Step 30S pee 
as we see in the usual manner by gathering coefficients of p *” on the right. 
Note that the relation (2.24) that we derived in case (i) is the same as (2.30), 
because %-1)4,,(p) = 0 when p\lo. Thus, in all cases we have (2.30). 

The next proposition combines the identities (2.30) for all p odd and 
(2.22) for p = 2. 


Proposition 8. Let /, be a squarefree positive integer, let b, denote the q- 
expansion coefficient of F,j., and set 2 = (k — 1)/2. Then 


1 Y-1)41,(P)P 
5 aly 7 =e ps - =a 
2 ral a 5 mts a ~ yd — pr : ) 


a-1-s 


(2.31) 


Proor. Let /, = p;'---p,". Let J, (Ss) denote the factor in the product corre- 
sponding to p (here f,(s) = (1 — 2*-?"*)"'). Then we must show that b,,.2 
equals b,, times the coefficient of /;* = py": +: p, po an fp. * Tp. WE use 
induction on r. For r= 1, this is ore what (2.22) and (2. 30) say. If 
r>1and /,=p;'!---p,"7', we assume the result for r— 1, and then we 
have (by (2.22) or (2.30) with /, replaced by /)/?): 


bi3p2%r = Pia (Coefficient of p,””” in f,,) 
= b,,: (coeff of Iz" in f, «++ fy,_,)(coeff of p,”” in f,,) 


by the induction assumption. But the product of these two coefficients is 
the coefficient of /;* in f,, ---f,,- This completes the proof. (Compare with 
the identity (5.8) for Hecke operators in §III.5.) Oo 


In a similar manner, one can derive identities for the g-expansion co- 
efficients a, for Ey, as /=[,/? varies over integers with fixed squarefree 
part /). We shall only give the result. One again proceeds one prime at a time. 
For odd p, the same identity holds as for Fi: 


A-1-s 


oO ae 1— X-1)41,(P)P 
XM P = Fo HI — pH a 


for fixed J), p?}/). But for p = 2 the identity turns out to be a little more 
complicated than for Fi. 
The most interesting Eisenstein series for '9(4) is the linear combination 
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Ay = (1 — 2a)(Eys + (1 + i)27*?F,,2) in Proposition 6. Let c, be the /-th 
q-expansion coefficient of Hy,. If /= op’, p? 1p, for an odd prime p, 
then, because of (2.30) and (2.32), we have the same identity for the c,: 


A-1-s 


: epee 1 = %~1)41,(P)P 
Sr pd — py 


(2.33) 


In the case p = 2, one has bj, 4y= 2" b,, and for a,4v one can derive 
formulas expressing a,,4v in terms of 5, 4v (these formulas are given in 
[Cohen 1973]). Combining these formulas for the case p = 2, one obtains 
the following result: if /) is a positive integer such that either (—1)*/, = 
1 mod 4 or else (— 1)*/, is four times an integer congruent to 2 or 3 mod 4, 
then the coefficients Chavo V= 0,1, 2,..., satisfy the same identity (2.33) 
with p = 2. Thus, under these circumstances the identity (2.33) holds for all 
primes p, including 2, in the case of Hy. 

The identities (2.33) for all primes p lead to a relation similar to (2.31), 
where either /, is squarefree and (— 1)*/, = 1 mod 4, or else /,/4 is squarefree 
and (—1)*/,/4=2 or 3 mod 4. It can also be shown (see the problems 
below) that c,; = 0 for / which are not square multiples of such /). Since the 
coefficients c,, in these cases are precisely the values L(y_1,4,,, 1 — A), we 
obtain the following proposition. 


Proposition 9. The element Ayr € My (To(4)) given by Ay, = C(1 — 2a) 
(Ey. + + i)2-*? F,) has q-expansion coefficients c, which can be de- 
termined by the identity 


es 1 = x14, P)p*** 
ep? = LY, 1-4 ee ee a 
2, Col = Meare LOT Goya pH 
where either |, is squarefree and (—1)*l, = 1 mod 4, or else 1,/4 is squarefree 
and (—1)*l)/4 = 2 or 3 mod 4. If | is not a square multiple of such an lp, 
then c, = 0. Finally, cy = €(1 — 24). 


(2.34) 


Thus, there are two very different elegant properties satisfied by the 
g-expansion coefficients of Hy,.. First, the coefficients corresponding to 
discriminants of quadratic fields are the values of the L-function for the 
corresponding quadratic character at the fixed negative integer | —A= 
(3 — k)/2. Second, for a fixed discriminant, the coefficients c, with / a square 
multiple of that discriminant satisfy an Euler product identity. 

Both properties are closely analogous to the results about forms of half 
integral weight which we shall need later to describe Tunnell’s work on 
the congruent number problem. The first property of Hj is parallel to 
Waldspurger’s theorem, asserting the existence of a modular form of half 
integral weight whose /)-th q-expansion coefficients are closely related to 
L(E,,, 1). The second property is an example of a very general correspon- 
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dence due to Shimura [1973a] between forms of half integral weight and 
forms of integral weight. 

Let us look again at the Euler product (2.34) for Hy.. The only part that 
depends on /, is the numerator 1 — X-1)41,¢ p)p* ‘~’. The remaining part 
of the Euler product 


I]d —p yl — pe2-syh, 
Pp 


depends only on H,,. and not on the choice of /). Observe that this is an 
Euler product we encountered before: it is the Euler product relating all 
the coefficients of the normalized Eisenstein series 


2 Bey ip By, 
Wea h et . eS i) 


(see Problem 16 in §III.3). We say that Fy2 € My2(F (4) corresponds to 
(— B,-1/(2(k — 1)))E,-, €M,-;(P) under the Shimura map. 

We shall discuss the Shimura map in more detail later. The Shimura map 
applies to forms of half integer weight which have Euler products similar 
to (2.34). As in the case of forms of integral weight, Euler products arise 
from the property of being an eigenform for Hecke operators. But when 
we work with half integral weight, it turns out that we only have Hecke 
operators 7,2 whose index is a perfect square; all other 7, are identically 
zero. This is why, in the case of half integral weight, we get a weaker type 
of Euler product, which only connects coefficients whose indices differ by 
a perfect square multiple. Hecke operators on forms of half integral weight 
are the subject of the next section. 


+ > 2 (n)q" 


PROBLEMS 


1. Verify the cusp condition for Ey., and find the value of Ey and Fy at all three 
cusps of T)(4). 


2. Derive (2.16) from (2.15). 
3. Derive (2.17). 


4. Check that the constants « and f given in the proof of Proposition 6 are the solutions 
to the two equations giving c\p, = L(zp, 1 — 4) in all cases. 


5. Suppose that we looked for a linear combination a£y. + BFjj2 whose /-th g-expansion 
coefficient for / squarefree is equal to L(x,-1)4,, 1 — 4). (That is, when (— 1)4/ = 2 or 
3 mod 4, this L-function value is c, rather than c,).) Show that no linear combination 
has this property for all such /. 


6. Show that if (—1)*/ = 2 or 3 mod 4, then c, = 0 in the g-expansion of H,,2. (Note. If 
Min (I'9(4)) denotes the subspace of My (T 0(4)) consisting of forms whose q-expan- 
sion coefficients satisfy this property, then it has been shown [Kohnen 1980] that 
the Shimura map gives an isomorphism of My3(T0(4)) with M,_,(T).) 
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7. What is the constant term in H,,.? What is the coefficient of the first power of g in 
Hy? When is this latter coefficient zero? 


8. Show that H.,. = 749° — £OF, and show how this gives Proposition 7. 


9. Show that for k >5 odd and for suitable a and b, we have O* — QE — bF yz 
€ Sy (T'9(4)). Find a and b. Express ©° and ©” in terms of Ey. and Fy. 
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We first recall how the Hecke operator 7, can be defined on forms of integral 
weight by means of double cosets. For simplicity, we review the case of the 
full modular group I. 

For n a positive integer and fe M,(T) we can define 7, fas follows. Let 
A" be the set of all 2 x 2—matrices with integer entries and determinant 
n. For any double coset Tal < A", where «€A", we define f|[ToI], = 
x f|Lay;],, where the sum is over all right cosets Pay; < Tal; equivalently, 
y; Tuns through a complete set of right coset representatives of T modulo 
a 1TaOT (see Proposition 41 in §III.5). Then 


TS en VAT], 


where the sum is over all double cosets of T in A”. 
There are not many double cosets of I in A”; in fact, if m is squarefree 
there is only one. More precisely, we have the following proposition. 


Proposition 10. A complete set of double coset representatives of T in A” is 
{a Say where ng, n, run through all positive UECETS such that n = non}. 
In particular, if n is squarefree, then A" = =TQ m0. On the other ges - 
n= p” is the square of a prime, then A? = es # oT Up’ (where p(t 


(pe pa))- 


Proor. Consider the abelian group Z* with standard basis generators e, = 
(4), e = (2). Any matrix «€ A" gives a subgroup of index n in Z’, denoted 
aZ7, namely, the span of we, and ae,, the columns of a. Conversely, any 
subgroup of index n in Z? can be obtained as «Z* for some a. Now fix any 
ae A". By the elementary divisor theorem (see, e.g., [Van der Waerden 1970, 
Vol. 2, p. 4]), there exists a basis e, e3 of Z* such that the subgroup «Z? 
is the span of n,e, and n,n,e, for some positive integers ny, n, with n= 
Non?. Let y, €T be the change of basis matrix from e,, ey to e}, e2, and let 
vai eT be the change of basis matrix from ae, mo to n,e}, 2yN,e>. Thus, 


ory;* ~~ =([ne1, n Noe] = nx yi an i.e., a= yn nyNo V2 EL (G n a) Since 
ae A" is arbitrary, this proves that the indicated double cosets exhaust A”. 
pecwaaee it is easy to see that these double cosets are disjoint: in fact, 


aeT(} nn = mT »,1 if and only if 7, is the greatest common divisor 
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of all entries in the matrix a. This is because, if « = y,«’y, with y,, er 
then d all entries of a’ implies dlall entries of «; conversely, because «’ 

71 ay", it follows that dlall entries of « implies d|all entries of a’. This 
completes the proof. oO 


In §III.5 we discussed the Hecke operators on forms of integral weight 
for the congruence subgroup I, (4). In that case one lets A” = A"(N, {1}, Z) 
(see (5.23) in §III.5), i.e., A” is the set of matrices of determinant n which 
are congruent to (} *) modulo N. For fe M,(I,(N)) one defines 


TARMOY AUN). 


where the sum is over all double cosets of ',(V) in A”. 


Proposition 11. /f g.c.d.(n, N) = 1, then a complete set of double coset repre- 
sentatives of T,(N) in A" = A"(N, {1}, Z) is (Gy, (es a aoe where Mo» ny are 
as in Proposition 10 and o,, is a fixed element of I’ such that o,, = zi nd 


modulo N. 


Proor. If «eA", we know by Proposition 10 that «= 7,71(5 °)y2, where 
¥1,72€I and n, is the greatest common divisor of the entries of «. We must 
show that « can be written in the form n,y{o,,(5 rota a yi, ET, (N), 
or equivalently, in the form 7,6, 7;(6 »,)72 With yj, = 10,,€1,(N). We 
suppose that n, = 1, i.e., m =m); the general case is onal similar. 


Lemma. /f g.c.d.(n, N) = 1, then a set {1;} of right coset representatives for 
Y modulo 1, (4) can be chosen so that t;€To(n). 


PROOF OF Lemma. Let t = (% %)eT be any coset representative. We look for 
(| 5)e1,(N) such that t’ = (% $)teT(n), in which case we merely replace 
t by v’. It is easy to see that there exist uw and v relatively prime such that n 
divides au+ cv. Let y=u+(j, +j,N)n, 6=v0+l, where j, and / are 
chosen so that u+j,;n=0 mod N, v +n =1mod N (this is possible be- 
cause g.c.d.(m, NV) = 1). If we show that j, can be chosen so that y and 6 
are relatively prime, then we can find a, B such that (7 §)e1,(N) and G te 
T')(”), because ya + 6c =au+cv=Omodn. But if u+j,1+j,Nn and 
v +n have a common factor, we first note that such a divisor must be 
prime to N (since v + /n = 1 mod N) and also prime to n (since g.c.d.(u, v) = 
1). Then if P is the product of all prime divisors of v + /n not dividing Nn, 
we can find j, such that uw + j,1 + j,Nn = 1 mod P. In this way we can find 
the required /,. 

We now return to the proof of the proposition. We have « = 7,(} °)y, 
with y,, y,€I-. Using the lemma, we write y, €T in the form Lie =i 7 where 
v1 €To(n) and y, ET, (4). Since yf eT o(n), it follows that (} °)"'y/(4 YeY. 
We write 
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, ae ve C °) 16 ° 

g= — 

71No » oe Aly 4) "1 9 Ya> 
-1.a 


where y; = (3 2) 17 ner. It remains to show that y3¢I,(N). But 
since «€ A” is congruent to (} *) modulo N, we have modulo N 


G6 96 Da( de 


from which it immediately follows that y3 ¢I',(). This completes the proof. 
Oo 


We now look at the analogous construction for forms of half integer 
weight. Recall that in that case we must work with the groups I,(4) ¢ 
G! <G, where 


b 
=4(a, o(2))|a=(“ ° \eGxi(Q), 
c ad 


(z)? = t(cz + d)/,/det « for some t = + ib, 
G' = {(a, o(z))eG|xeT}; 


r(4) = ft z))|a = : *e T,(4), j(@, z) = (5) eqi fez + a}. 


Suppose that 4|N and fe My(T ,(V)). Let n be any positive integer prime 
to N. In the case of integer weight we defined 7,, By, considering double 
cosets of the form I’, (N)ao, AG) mi (N), where n = ngn?. So for half integer 
weight one considers double cosets of the form f ON) Gn, Sn oli (N), where 
fn, €G is any lifting of (6 ?), Le., 6, =((d m)s init) for some t = +1, +2. 
Since ¢ would turn out only to affect our definition by the constant multiple 
t*, for simplicity we agree always to take r= 1. Also for simplicity we 
consider the case n, = 1, ny) =n; the general case is completely similar. 
So we now examine the action of the double coset P',(N)é,1', (NV) on f, where 


1 0 
= 4 
o=((0 ,) 9) 
That is, we compute 


AT MER Oye GE SEF ee (3.1) 


where the sum is over all distinct right cosets of I’, (N ) in our double coset, 
i.e., over a set {},} of right coset representatives of ',(N) modulo I” = 
&. if L(N)é, OT, (N) (see Proposition 41 in §II.5). 


Proposition 12. [fn is a positive integer prime to N which is not a perfect 
square, then f |[T\(N) E10 (N) Jaz = 9. 
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Proor. Given «€GL3(Q) and €=(a, d)eG, we construct a map from 
Vs aT (N)anl,(N) to T= {+1, +i} in the following way. Given 
y= OO ‘y,% with y, y,€T,(N), observe that # and €-'f,é€G, both have 
the same projection in I’, and so they differ by an element of the form 
(6 3), 8), ie., 
oF 6 = FC, 2. 
For fixed « and €, we consider the map that associates to y the number 7. 
One verifies (see the exercises below) that f(y) is a group homomorphism 
from I” to T, that it depends only on « and not on the choice of $(z) in 
= (a, p), and that in the case « = (} °) we have 


(y) = (4) for y= (: mer =a IP (N)anl,(N). (3.2) 


Let K < I” denote the kernel of this map ¢. 
pire the definition [’” = €'f ay )é, 11, (N). We claim that K =f’, 
, if ef, (N) is of the form &15,é, with y,e€T,(N), then rsh and 
eens To see this, first suppose that y, y,¢I,(N) and 7 = &1,é,. 
Applying the projection P: G > GL} (Q) gives y = (§ °)1y,(3 %), ie., ye” 
=a 'T,(N)anT,(N). Since &'F,é, = § = 7(I, 9, it follows that ye Ker t. 
Conversely, if yeK CT’, so that y=a'y,a and & 13,6, =F, 0) with 
t=1,it immediately follows that jeT”. 

Thus, in general, I” =K is smaller than f”, ie., the intersection of 
EP N)E, with I (NV) is a subgroup of the lifting of TM’ = «"T,(N)an 
I',(N). This subgroup I” is all of f’ if and only if the map ¢ is trivial. In 
our case t(y) = (4), so that ¢ is trivial if and only if 7 is a perfect square. (We 
are always assuming here that n is prime to N; the case n = p|N is treated 
in Problem 3 below.) If 7 is not a perfect square, then I” is a subgroup of 
index 2 in I’. In that case let P’ =I” Uf’t bea right coset decomposition ; 
thus, t= a" 't,a and t= &'4,6,-(1, —1). Let P(N) = (JI; be a right 
coset desemiposition of T,(N) modulo I’. Then 


Ph) =O Ury 
j j 
is a right coset decomposition of f',(N) modulo f” = €7'Ty(N)Eé, 0 F,(N). 
By the definition and Proposition 41 of §II.3, we have 
f\EF, (NEL (N) Iyo = VFS Flee + VSS tI liye: 
J J 
But for each j we have 
F Lent ye =f [Gt SE Fdue 
=f |[t,d, —1) 6,7 u2 
=/|(Cd, —N EF Jue 


because f is invariant under [7,],. for t,ef,(N). Since [(, —1)]y2 = 
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(—1)* = —1 by definition, we have 
SF \LenF we +f |[E7% Jue =f [EF ]y2 — f |\LEF]y2 = 0. 
This completes the proof of Proposition 12. Oo 


The same argument will show that f|[T,(N)6,,6,,11(N)]wo = 0 if no 
is a positive nonsquare integer prime to N. 

Because of Proposition 12, we can only work with a Hecke operator on 
My2(P(N )) of index prime to A if that index is a perfect square n? ; other- 
wise, the Hecke operator is identically zero. Recall that the building blocks 
for the Hecke operators in the integer weight case are the 7, for p a prime 
(see Proposition 32 of §III.5). Similarly, in the half integer weight case, the 
building blocks for the 7,2, g.c.d.(n, N) = 1, are the T,2, p N. 

So let us examine in detail the action of 7,2 on MyoT (N)), where T, 
is defined on fe My, (N)) in the following way: 


a = 1 0 
Tp f=P? FM éeF (N) jay, where €2 = (( oa ) 
(3.3) 


This definition is not quite the immediate analog of 7,2 acting on fe 
M,(1;(N)), which is given by p*?°(f|[T,(N)(G pn (N Ne + x(p)f) for 
fEM,(N, x), because A?’(N, {}OQ=T (N) (5 poly (N) UT, (N)o,(% oo 
T,(N) by Proposition 11, and f |[o. oy(8 ory (N)], = = x(p)f. Besides replacing 
k by k/2 and T',(N) and (4 >.) by their liftings P(N) and €,2, we also drop 
the trivial double coset T,(N)o,(% 9)P',(N). In the case of integer weight, 
when 7, and not 7,2 is the building block, 7,2 involves two double cosets ; 
but for half integer weight we shall agree to take only the nontrivial double 
coset. 

As in the case of integer weight, in studying MrT, (N)) it is convenient 
to decompose it into y-components. If x is any Dirichlet character modulo 
N, recall from §1 that 


Myo(Fo(N), Date M,2(P(N))| S| = x(a) f 


a b 
forall y= d ET (N) ?; 
c 


My(1(N)) = @, Myo(F oN), 2, 


where the sum is over all even Dirichlet characters modulo N (of course, 
Myo(To(N), x) = 0 if x is an odd character, since f= f|[—1]y2 = x(— Df). 
By an argument which is completely analogous to that in the integer 
weight case, one can show that 7,2 takes My; (N)) to itself; moreover 
(see Problem 5 below), it preserves the y-component. 
We now fix a Dirichlet character y modulo N, suppose that fe My (P(N), 


and that 
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y), and evaluate (3.3) explicitly. Our purpose is to express the Fourier 
coefficients for 7,2 fin terms of those for /- 

By the lemma in the proof of Proposition 43 in §III.5, we know that 
A?’(N, {1}, Z) is the disjoint union of the right cosets of T,(N) with repre- 
sentatives 


0 
h 2 0 
h=o(¢ ’) for O<h<p; and r= o,(" ') 


Here for n prime to N, a, denotes a fixed element of such that o, = (1 °) 
mod N. The trivial double coset is the right coset corresponding to Bp; 
thus, we have the disjoint union 


1 5b 
a =( a for 0<b<p?’; 


p?-1 


P(N) ¢ 2) FM) =U Tr) u U T,(N)B OT, (N)e. 


In order to evaluate (3.3) using the definition (3.1), we need the right coset 
decomposition of PN)E, 21',(N) in the form (a (N)é,2);- If we could 
write each «,, 8,, t in the form (¢ mys where vs ri ,(N), then the Y would be 
right coset representatives of T” A) modulo (5 ee 20 (N)(5 boa T,(N), 
and so the corresponding liftings ? would be right coset representatives of 
T,(N) modulo op aT, (NE, 201,(N) (see Problem 1(b) below). In that 
case, (3.3) is equal toZ SL 27 ]y2- 

Here we may adjust «,, B,, or t by multiplying on the left by any y’"'e 
I',(N)—this merely replaces one right coset representative of T',(N) in 
TN Nes por, (N) by another representative of ps same coset. In other 
words, it suffices to write «,, B,, t in the form y’(j mys where y, y’€IT',(N), 
and then compute 


Te S(2) =p? Y) FAI|[Sp2Flya- (3.4) 


The result will be the following proposition. We shall go through the 
detailed computation of 7,2 f after the statement of the proposition. 


Proposition 13. Suppose that 4|N, x is a Dirichlet character modulo N, p{N 


is a prime, and k = 24 + 1 is a positive odd integer. Let f(z) = XP-o Ge he 
My o(N), 2). Then 


T,2 f (2) = yy b,e27" 
n=0 
where 
—1)4 = 7 
by = Ayn + XP) (<u > "py *y + XCD?) P Any? (3.5) 


(here we take Gyp2 = 0 if p* }n). 
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Proor. As explained above, we must write each ,, f,, t in the form »’(j py 
with y, yer i(N). First, «, = (3 ») =(} PIG) >), i.e., for a, we can take 
=ly=(3 ). ne the contribution to GB. 4) from all of the «, is equal to 


voZ voll NG DD). 


= pir-2 S (2) or 
=F 5 dG *) 


p?-1 s 2 . 
2nin(z+byp? _ 0 if p fn > 

» € 2 ,2ninz/; 7 2 
prerninaiP’ if p?|n, 


Since 


b=0 


we see that this contribution is 2, a,2,e7""”. Thus, the «, give the first term 
on the right in (3.5). 

We next evaluate the contribution of t. Since g.c.d.(p”, N) = 1, we can 
find two integers u, v such that up* + vN = 1. Then we have 


iG :) p? —v\f1 0\(/p'u v 

T= O. = 0, ’ 

Vo 1) P\N wu /\o p2/\-N 1 

i.e., we take y’ = 0,2)” with y” = (% <="), and y = ee 1); then y’, ye, (4). 


We have 
S(2|LE27F7 ye = f(2| [6,29 E20 aya 


1 0 
= 71(p?)f(2| ace z))° ((( a ve) (JQ, »| 
Pp 2 


A simple computation of the product inside [ Jy2 gives ((% 9), p-"”). 
Thus, the contribution to (3.4) from t is equal to 


7 p? 0 _ z 
PX? *y(p*)f(2)| sp?) = p©?)2y(p?)pf (pz) 
0 1 2 
= p*7y(p?) Yarns, 


Hence, t gives the third term on the right in (3.5). 

Finally, we evaluate the contribution of the f,, 0<h <p. Again we 
want to write f, = o,(§ *) in the form o,) ae poy with y and o,y” in T’,(N). 
So we look for y = (¢ 8)eT,(N) and y”eI,(N) such that (§ 5) = 9" 
@ ashes 


led Gr hye. ap 
0 p/\c d 0p?) \0 p/\—-c_ ap? pee 


Clearly pla, so we write a = pa’, c = Ne’. Thus, we need —b/p + ha'/peZ. 
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So first choose any integer a’ prime to p such that a = pa’ = 1 (mod N). 
Then any (4 °)eI with a = pa’, c = Nc’ will be in T,(N). Next choose any 
b prime to a’ such that b=ha’ (mod p) (this is clearly possible). Since 
g.c.d.(Nb, pa’) = 1, we can find c’ and d such that dpa’ -- c’Nb = 1. Thus, 
¢ a) = (Ke ge, (), and 


D a ee ea 2) pa ,) 
ee —pe a ee d]’ 


which we denote "(5 py? As in the previous paragraph, where we evaluated 
the contribution of t, we obtain 


1 0 
S(2)|LE 2-7 aye = U(P)f(2)| Gace z)) . ((( 2) vr) “MIO, 2»| : 
k/2 


We compute 


(( —he (ah— wa) (=F) - =e Fa) 


—pc a a 


1 0 a b c\ 
(Gp) (CE Deere) 
_({pd—he (ah—b)p\ (=pc\-1 = 
=(( as a'p? \ & V~—cz +a 
({pa 6) (c\ - 
(ea) Qevera) 
—({P h C\ (TPC) .-1,-1 
(6 2) GF)e*") 
Since dpa’ — c’Nb = 1, we check that ¢,.€, = €,(4*)(— I)", and that 
(aa) = (5). Thus, 


OF MAOH) orn 
ae) 


since b = ha’ (mod p); but bc = —1 (mod p), and so we finally obtain 


(Gales), 


Thus, the contribution of all of the £, to (3.4) is equal to 


(k/2)—2 fee ph -1 =) 
D xp) d S(2) iG Oe (+ - 
= p¥2)-2 =) nS (4) ( *) 

Pp uo) (= & 2 - ‘i +7). 
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mh BV mie (hh 
oy (*s(: fs *) = 3 teu y (: )e e2tinh/p _ = €,,/p y (2 "a erring | 
a=1 \P P n=0 n=1 \P 


where we have made the usual change of variables (replacing nh by h) and 
used the value ¢ ox). for the Gauss sum. Thus, we obtain 


pi) “(= a gt ip > (3 ae e2ninz 


“ly(p) Gy 3 (5)e e2ninz 
P} n=o\P)" 


So the contribution to the n-th coefficient of 7,2 f(z) is p*"'x( p)(=*)a,. 
This is the middle term on the right in (3.5). The proof of Proposition 13 
is complete. Oo 


We note that it can be shown (see Problem 3 below) that the formula 
in Proposition 13 also holds when p|N, in which case y(p) = y(p?) = 0 
so we have simply 5, = a,2, 

We further note that, as in the case of integer weight, one can show that 
the different Hecke operators 7,2 commute; that Ty2n2 = Th2Tm2 when 
g.c.d.(m, n) = 1; and that T,2v is a polynomial in 7,2. Thus, the operators 
T,2 for different p generate the algebra of operators C[ {7,2}, ]. 

In the case of integer weight, we applied a formula analogous to (3.5) 
in the case when we have a modular form which happens to be an eigen- 
function for all of the Hecke operators. The result was a formula for the 
ratio of a, to a,, which can be written in the form La,n-* = a, - (Euler 
product). (See Propositions 36 and 40 in §III.5.) 

In the case of half integer weight, we can consider modular forms which 
happen to be eigenfunctions for all of the Hecke operators. But since only 
the 7,2 are nontrivial, we only obtain a formula for the ratio of a),12 to 
a), 1.e., we can relate coefficients whose indices differ by a perfect square 
factor. 

As in the integer weight case (see the end of §III.5), the spaces My. (Fo(N), 
7%) have a basis of eigenforms for all Hecke operators of index prime to N; 
and certain important subspaces have a basis of eigenforms for a// of the 
Hecke operators, i.e., for T2 when g.c.d.(n, N) = 1 and for 7, when p|N. 
Thus, let us now suppose that fe My2(To(N ), x) is an eigenform for all of 
the Hecke operators 7,2. 


Proposition 14. Let f(z) = 2%.) 4,e?""" € Myo(To(N), x) be an eigenform for 
all of the Hecke operators T,2. Let 4, be the corresponding eigenvalue, i.e., 
T,2f = A,f. Suppose that |, is not divisible by any square prime to N (i.e., 
p?|/y only if p|N). Then 
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_1\4 
fies x(p) ( : 4) pris 


a 2 s=a =5 =J-35° 
», ia oll Lip +a pe? 


(3.6) 


(Note: The use of the letter 1 to denote (kK — 1)/2 and its use with a sub- 
script to denote an eigenvalue should not cause confusion.) 


ProoF. If 7,2 f = Af with p} N, then (3.5) gives for any /, prime to p: 


Z = a-1 (- 1)*lo ‘ GB 7) 
p4igi? = 41,12 p2 +P” X(P) sage 41,12 3 . 
ha =a + A-1t ( y) (—1)%% a + k-2 ( Va 
pF igi? p2¥ = Aiji2p2t1) + PO XP ? Iglzp2¥ TP XP )Qi 12 p2v-1)5 
(3.8) 
VST De ess If p|N, then we need not assume that /, is prime to p; we have 


the same relations (3.7)—(3.8) in all cases, with only the first term on the 
right nonzero when p|N. 

On the other hand, if we look at the terms in (3.6) corresponding to all /, 
which differ by a power of p, 1.e., if we consider aj )2,2v(/,p") * for fixed /, 
prime to p (if p{ N) and variable v = 0, 1, 2, ..., and if we set ¥ = p™’, we 
find that (3.6) is formally equivalent to the following set of identities for all 


p and all /, prime to p (if p/N): 
4) 
z 1 — x(p) (Ue) px 
>» Up X" = An 1—A,X + x(p?)p?X? © 


But when we multiply both sides of (3.9) by the denominator | — 4,X + 
y(p?)p*- *X? and compare coefficients of Y’*!, we obtain (3.7) for v = 0 and 
(3.8) for v= 1, 2, .... Thus, (3.9) holds, and we have established (3.6). o 


(3.9) 


Proposition 14 explains the appearance of Euler products of the type we 
found in the last section (compare (3.6) with (2.31)). 

In the next section, we start by formulating Shimura’s theorem, which 
gives a deeper significance to the Euler product in Proposition 14. The Euler 
product (3.6) turns out to be closely related to a Euler product for a modular 
form of integral weight k — 1. 


PROBLEMS 


1. Let € = (a, $(z))eEG, let T’ < T(4) be a congruence subgroup, and let P” =T’n 
a 'T’a. For yel” define t(y) by the relation €7E~! = 7, - (1, t(y)), where y, = aya7!. 
(a) Show that the map ¢ depends only on « and not on the ¢(z) in &. Prove that ¢ is 

a group homomorphism from I” to T. 
(b) As usual, let ~ denote the lifting of an element or subgroup of I',(4) to G, ie., 


212 IV. Modular Forms of Half Integer Weight 


} = (7, /(, Z)) for yeTy(4), F = {j|yeI} for ’ <1,(4). Let K <I” be the 
kernel of ¢. Show that K =f’ n é-'T’€. Thus, if ¢ is trivial, then y+ 7 gives an 
isomorphism from I’’ tof” =I’ an €“!LE. 
(c) Prove that for feM,,(f’), if ¢ is nontrivial, then f|[T’EI’],,. = 0. Recall that 
APP Ay, ==,f|[E%]y2. where the sum is over all y; such that Pel’ = 
(J, P’€4; disjoint union). 
2. Prove that ¢(y) = 4) fora =(} 9, y= De (4) 001K (4a. 


3. (a) For’ =T1,(N), 4|N, and « = (§ 9), show that t is trivial if and only if 8|N. 
(b) For I’ =1,(N), 4p|N, and « = (§ 9), show that ¢ is trivial. 
(c) For I’, p, and « as in part (b), show that 7,v f(z) = Lay,’ for f(z) = 
La," € My, (P,(N)), v= 1, 2, .... If p= 2 and 8}N, show that this is still 
true for v even. In particular, Proposition 14 holds for p|N. 


4. Compare the formula for the q-expansion coefficients 5, for 7,2f when fe My. 
(I,(N), x) with the corresponding formula for T,2f when fe M,(N, x). 


5. We noted in the text that 7,,. takes My(T; (N)) to itself. Show that 7,. preserves 
My o(N), x) for any Dirichlet character x modulo N. 


§4. The theorems of Shimura, Waldspurger, Tunnell, 
and the congruent number problem 


We now state Shimura’s fundamental theorem giving a correspondence from 
forms of half integer weight k/2 to forms of (even) integer weight k — 1. 


Theorem (({Shimura 1973a]). Let k > 3 be an odd integer, 4 =(k — 1)/2, 
4|N, y be a Dirichlet character modulo N. Let f(z) = Up: a,e7""" € 
SioTy o(N), x) be an eigenform for T,2 for all primes p with corresponding 
eigenvalue A,: Tpf=4,f. Define a function g(z) = Xp, b,e°""" by the 
formal identity 


(4.1) 


= 1 

bn = — 
x Ty ~~ ApP "tte x(p)*p 
Then ge M,_,(N’, x?) for some integer N’ which is divisible by the conductor 
of x7. If k = 5, then g is a cusp form. 


k-2-2s° 


Notice that the definition (4.1) of the 5, is equivalent to the following: 
(1) 6, =1; 
(2) b, = A, for all primes p; 
(3) byv = A,bpv-1 — X(p)?p*7b,v-2 for v > 2; 
(4) Din = Om0, if m and n are relatively prime. 


In Shimura’s original theorem, the determination of the level NV’ of g was 
a little complicated. However, it has since been shown ([Niwa 1975]) that 
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one can always take N’ = N/2. It should also be noted that Shimura actually 
proved a somewhat more general theorem, applying to f which are not 
necessarily eigenforms for all of the T,2. 

As a simple numerical example, suppose N = 4, x is trivial. The first 
nonzero cusp form of half integer weight occurs when k = 9, 1 = 4 (see 
Problem 5 of §IV.1). Up to a constant multiple, it is f= OF(O* — 16F) = 
2Za,q", where we have chosen f so that a, = 1. (By Problem 17(h) of §III.3 
this f is also equal to y'(2z)/@%(z).) Clearly, f is an eigenform for all T,2, 
because So, (f')(4)) is one-dimensional. Then Shimura’s theorem holds with 
N’ = 2. Now S3(I9(2)) is one-dimensional and spanned by the normalized 
form g(z) = (n(z)n(2z))* = qT, (1 — 4")°(1 — q7")® = £b,q" (see Proposi- 
tions 19 and 20 in §III.3); hence this g(z) must be the g(z) in the theorem. It 
is now easy to relate the coefficients 5, of g to the coefficients a, of f. Namely, 
using (3.7) with /) =/, = 1, 1 =4, and noting that a, = 1, x’ = x (this is 
the trivial character mod 2, which equals 1 on odd numbers and 0 on even 
numbers), we obtain: 


b=A,=an+p? if p>2; b,=4a4. (4.2) 


While (4.2) follows immediately from Shimura’s theorem, it is nevertheless 
quite a remarkable numerical identity: the p-th coefficient in gII(1 — q")® 
(1 — q?”)8 is equal to p* plus the p?-th coefficient in g I (1 — q?")!2/(Zq")?! 
Like many numerical relations that follow from the theory of modular forms, 
this fact looks rather outlandish when stated in this elementary form without 
the theoretical context. 

If we have a fixed set of linearly independent forms f; in Sij2(To(N ay) 
which satisfy the hypotheses of Shimura’s theorem, then we can extend the 
Shimura map by linearity to the subspace of Seal, o(N), x) spanned by them. 
Note that the image g; of f, is always a normalized eigenform in S,_,(N/2, x7). 
If we take  .other set { f} of forms which satisfy Shimura’s theorem and 
are also a sis for the same space as the f; (for example, if we multiply each 
Jf, by a scalar), the Shimura map is clearly affected. When we refer to the image 
of a single eigenform under the Shimura map, we shall always mean the 
normalized eigenform g in Shimura’s theorem. But if we have a space of 
modular forms with fixed basis f; of eigenforms with Shimura( /;) = g;, then 
we define Shimura(2 a;f;) = Xa,g,, which is not necessarily a normalized 
eigenform. So our meaning of “image” or “preimage” under the Shimura 
map depends upon the context. 

In general, it is possible for several different fe Sya(Fo(N ), X) to go to the 
same ge M,_,(I)(N’), x’). For instance, there might be an f’ € Syy2 (To(N), x’) 
which corresponds to g, where x’ is a character different from 7 which has 
the same square: v7 = y’’. However, when N = 4, Kohnen [1980] described 
the situation more precisely. We now describe his main result. 

Let Mjj.(Fo(4)) denote the subspace of My2(10(4)) consisting of f(z) = 
2a,q" for which a, = 0 whenever (— 1)*n = 2 or 3 modulo 4. It is certainly 
a priori possible that there are no nonzero forms f with this property, which 
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requires that “half” its coefficients vanish. However, we know from Proposi- 
tion 9 that H,,;. is such a form. Also, it is easy to check (using Proposition 
17(a) of §III.3 and (1.8) of §1V.1) that ©(z) f(4z) € Myj2(T9(4)) for any 
fEMy-12(SL2(Z)). It turns out that M,5(P5(4)) is the direct sum of the 
one-dimensional space spanned by the Eisenstein series H,,. and the sub- 
space S,/,(F9(4)) of cusp forms: 


Syja(To(4)) = {f= Yang" € Syj2(Fo(4))|@, = 0 if (—1)*n = 2 or 3 mod 4}. 
(4.3) 


It is not hard to show that M,jo(To(4)) is preserved by all of the Hecke 
operators 7,2 except for T,. According to Shimura’s definition of T,, one 
has 7, 2a,g" = La,,g". If we look back at §1V.2, we see that F,,. is an 
eigenform for 7, (with eigenvalue 2‘~*, see (2.21)), but Hj). is not. In fact, 
it is easy to see that T Aygo ¢ Myjy(Fo(4)). For this reason, Kohnen modifies 
T,, and defines a slightly different operator Tj so that the m-th coefficient 
of Tj Za,q" for (—1)*m = 2 or 3 mod 4 is zero and for (—1)*m =0 or 1 
mod 4 is equal to 


Ge EX Ap le aah aig: 


With Shimura’s definition, since we have y(p) = 0 when p|N even if x is the 
trivial character on (Z/NZ)*, the second and third terms vanish in 


dam + i=in a te pao ais 


Thus, Kohnen’s modification is to replace the trivial character 7 by the map 
which takes the value | (and never 0) on all numbers including those not 
prime to N. It is 7; rather than 7, which preserves M,j,(f(4)) and 
Sii2(To(4)). Note that H,,2 is an eigenform for T,' with eigenvalue 1 + 2-2. 

Kohnen further shows that M;,;1(F (4) has a basis of eigenforms for all 
of the 7,2 (p # 2) and for T{ which is unique up to permutation of the 
elements and scalar multiplication. There is no obvious way to normalize 
an eigenform f= La,gq"; for example, we cannot necessarily multiply by a 
scalar to get a, = 1, since a, = 0 for all fe Myi3(To(4)) if 4 is odd. But one 
can require that the coefficients all lie in as small a field extension of Q as 
possible. 

It turns out that the images g of these eigen-basis forms fe Sw o(4)) 
under the correspondence in Shimura’s theorem are all contained in S,_,(T), 
YT = SL,(Z) (the results of Shimura--Niwa only guarantee that they are in 
S,—1(19(2))) ; they are all distinct ; and they form a basis for S,_, (I) consisting 
of normalized eigenforms for all of the Hecke operators 7, acting on S,_, (I). 
(In particular, dim Sij2(T9(4)) = dim S,_,(I).) Thus, if we take each of our 
basis elements for S;,/,(I'o(4)) to its image under the Shimura map—and take 
Hy. to the normalized Eisenstein series —/— E,.,-—and then extend by 
linearity to all of M,j,(Fo(4)). we obtain an isomorphism from M,j,(fo(4)) 
to M,_, (I) (and from S,),(1'9(4)) to S,_,(I)). This isomorphism commutes 
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with the Hecke operators, in the sense that: 


M,j(Fo(4)) af > gE M,-1 1) fr—g 
T,| \% and ri | | 
Tpfr Tg Tift Tg 


The basic method of Shimura’s proof of his theorem was to use Weil’s 
theorem which we discussed briefly at the end of §III.3. Weil’s theorem says 
that if £b,n~* and its “twists” 2b,W(n)n-* for certain Dirichlet characters 
w each satisfy the right type of functional equation relating the value at s 
to the value at k — 1 — s, then g = Db, q"eM,_,(T9(N’), x’). But the proof 
that all of these functional equations are satisfied is not easy; about twenty 
pages of [Shimura 1973a] are devoted to an investigation of delicate analytic 
properties of the Dirichlet series corresponding to g and its twists. 

Shimura’s correspondence seems rather roundabout. It says: take the 
q-expansion of a suitable fe Sy2(Po(N ), x); look at the g-expansion coeffi- 
cients a, as n varies over integers with fixed squarefree part /), and form a 
Dirichlet series from them which turns out to have an Euler product; then 
take the part of this Euler product which is independent of /), and expand it 
into a new Dirichlet series £5,n~*; and finally, go from this new Dirichlet 
series to the g-expansion £ 5,q", which will be your modular form of integral 
weight. 

After Shimura’s paper appeared, people started looking for a more 
conceptual, less roundabout construction of the Shimura correspondence. 
Certain more direct, analytic constructions were given by Shintani [1975] 
and Niwa [1975]. In addition, group representation theory was found to 
provide a conceptual explanation of this correspondence (see [Gelbart 1976], 
[Flicker 1980]). Moreover, the use of representation theory has led to 
striking new results about forms of half integer weight, especially in the 
work of J.-L. Waldspurger. 

Using representation theory, Waldspurger [1980, 1981] proved a remark- 
able theorem establishing a close connection between critical values of 
L-series for a modular form g of weight k — 1 €2Z and the coefficients in 
the g-expansion of a form f of half integer weight k/2 which corresponds to 
g under the Shimura map. Roughly speaking, the theorem says that the 
critical value is equal to the square of a corresponding q-expansion coeffi- 
cient times a nonzero factor which can be explicitly described. Waldspurger’s 
general result is complicated to state, so we shall only describe what it says 
in two particular situations. 

As mentioned before, Kohnen [1980] showed that the Shimura map gives 
an isomorphism 


Sija(To(4)) => S,-1 (1). (4.4) 
Here Sijo(Fo(4)) is defined in (4.3). Let g(z) = 2b,g"e€S,_,(I) be a nor- 
malized eigenform for all of the Hecke operators, and let yp be the character 
corresponding to the quadratic field of discriminant D. Suppose that 
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(—1)*D > 0, ie., the quadratic field is real if 2 = (k — 1)/2 is even and it is 
imaginary if A is odd. Recall that L,(zp, 5) denotes the analytic continuation 
of 21 Xp(n)b,n * (which can be shown to converge absolutely if Re s > 
k/2). Let f(z) = Lang” € Sy (To(4)) be the unique preimage of g under the 
Shimura map (4.4), i.e., g = Shimura(/). Let < f, f> and <g, g> denote the 
Petersson scalar products, where the same definition (see (5.31) in §III.5) is 
used for half integer weight as for integer weight, i.e., 


EP=—( |e 


Fo(4) 


|2y k/2 AXAY eidy: 
a 


where fo(4) is a fundamental domain for T,(4). 


Theorem ({Kohnen—Zagier 1981]). With the above notation and hypotheses, 


« VIPI <9. 9> 2 
Lito.9= (Bi) Qo eer (4.5) 


The basis of eigenforms fe Sip o(4)) can be chosen so that the g-expan- 
sion coefficients are all in some totally real number field. However, there is 
no natural way to normalize them: we can multiply each f by an arbitrary 
constant c in that field. But note that the right side of (4.5) remains unchanged 
when f is multiplied by c, since ajp and <f, f> are both multiplied by c?. 
So (4.5) does not depend on our choice of basis in defining the Shimura 
isomorphism (4.4). 

The L-series value in (4.5) is a “‘critical value” in the following sense. 
Recall that the Riemann zeta-function has a functional equation relating 
C(s) to €¢(1 — s), and the region 0 < Re s < 1 is called the “critical strip” for 
¢(s). Similarly, the Hasse—Weil L-function of the elliptic curve E = E, in 
Chapter IT has a functional equation relating L(E, s) to L(E, 2 —s), and 
the region 0 < Re s < 2iscalled its critical strip. The value of such a function 
at an integer in the critical strip is called a ‘‘critical value’’; in the case of 
L(E, s) the critical value is L(Z, 1). It is such critical values that have been 
found to have arithmetic significance. (A general context for the study of 
critical values is described in [Deligne 1979].) In the case of the L-functions 
for modular forms g of weight k — 1, it turns out that they have functional 
equations relating L,(y, 5) to L,(y, k — 1 — s). (Of course, y = % when we 
are working with quadratic characters y = yp.) Thus, the critical strip for 
L,(xp, 8) is O< Res <k—1; and the critical values are L,(7p,/) for 
J=1,2,...,k —2. The critical value L,(yp, 4) at j = 2 = (k — 1)/2 in (4.5) 
is the value taken at the exact center of the critical strip, i.e., at the fixed 
point under sok —1—s,. 

The first numerical example of the theorem of Kohnen—Zagier occurs 
when k = 13, 2 = 6, since S,,(IT) = CA is the first nonzero space of cusp 
forms for I. In this case g = A = i t(n)q", and f = OF(O* — 16F)(@* — 2F), 
where F=,o449,(”)q", O=ZXq" (see Problem 5(c) in §IV.1; this 
fe Siis2(To(4)) can be given other convenient expressions, for example, in 
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terms of © and £,). For more computational details of this example, see 
[Kohnen—Zagier 1981]. 

Our second example of Waldspurger’s theorem is the one studied by 
Tunnell for application to the congruent number problem. In S§III.3 we 
explained that the Hasse—Weil L-function L(E,, s) = 2b,n“S for the elliptic 
curve E,: y*? = x* — x corresponds to a cusp form £b,q" of weight two. 
In turns out that g(z) = £4,q" is in S,({(32)); g is a normalized eigenform 
for all of the Hecke operators, and, in fact, is the unique such “‘new-form’’, 
i.e., form which does not come from any lower level N < 32. 

For n squarefree, let D= —n if —n=1 mod 4, D= —4n if —n =2 or 
3 mod 4; that is, D is the discriminant of the imaginary quadratic field 
Q(./—n). In Chapter II we saw that L(E,, s) = Xyp(m)b,,m~ is a twisting 
of L(E,, 5). (Actually, in (5.7) of $11.5 we wrote x,(m) rather than xp(m); 
but 5,, = 0 unless m = 1 mod 4, in which case xp(m) = (3?) = (4) = x,(m), 
so one can equally well use either yp or y,. Because we will be looking at 
the critical value at 4 = 1, we want to work with quadratic characters of 
imaginary quadratic fields, i.e., (—1)*D = —D > 0.) 


Thus, we can write 
L(E,, 8) = L,(s) = b,m™*; 


L(E,, 8) = Ly(Xp; 8) = YX) bym™. 


We saw that the critical value L(E,, 1) = L,(yp, 1) vanishes if and only if n 
is a congruent number (‘‘only if’? here is conditional upon the Birch— 
Swinnerton-Dyer conjecture). It is this critical value which Waldspurger’s 
theorem provides a means of describing. 

Let B denote the “real period” of E, : y? = x3 — x, which is obtained by 
integrating dx/y over the segment [1, 00) where y is real: 


bax | 2 = 2.622: -., 


1 x” —xXx 


Theorem ([Tunnell 1983]). There exist a form f= Xa,q ™ESs (To (128)) 
and a form f = Zaiq™ € Sya(T (128), %2) such that Shimura(f) = 
Shimura(/’) = g = 26,,q™ and 


oer for n odd; 


4m 
Bb 
2 


In particular, L(E,,, 1) = 0 if and only if a, = 0 (n odd) or aj) = 0 (n even). 


Le h= (4.6) 


a,j for n even. 


Before discussing Tunnell’s explicit construction of the forms f and f’, 
we shall discuss how this theorem can be viewed as an analog of the results 
about Hj. which we proved in §IV.2 (see Proposition 6). 
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We saw that under the Shimura map Ay. = © c.g" € My (TF o(4)) corre- 
sponds to 


2(k — 

By-1 

In Problem 16 of §III.3, we saw that L,(s) = C(s)-¢(s — (kK — 2)) and that 
L(y, 8) = L(y, s) L(y, s — (k — 2)), where the L-functions on the right are 


Dirichlet L-functions. In particular, setting y= yp and s= 1 = (k — 1)/2, 
we obtain 


DE, = 562k) + DaagreM,(). 


Ly(xXp, 4) = Lp, LX, 1 — 4). (4.7) 


But, by the functional equation for L(yp, 5), we can rewrite L(yp, 4) in 
terms of L(zp, | — A) as we did in (2.15). We obtain an expression for L(yp, A) 
as a product of the form «- L(yp, 1 — 4), where * denotes a nonzero factor 
involving the gamma-function and powers of z. Substituting in (4.7) and 
using Proposition 6, we obtain 


L(x; A) Sr Cf): (4.8) 


When reformulated in this way, the results of §1V.2 are very similar to the 
previous two examples of Waldspurger’s theorem. As in the Kohnen—Zagier 
formula, on the left in (4.8) we have the value of the L-function for some 
géM,_, (I), twisted by xp, at the center of its critical strip; on the right we 
have the square of the corresponding q-expansion coefficient of the form in 
Mio (I'9(4)) which goes to g under the Shimura map. However, the Kohnen— 
Zagier theorem does not include this case, because g and f= Hy. are not 
cusp forms (in their formula, one cannot even define <f, f> and <g, g> 
except for cusp forms). The case (4.8) is not even included in Waldspurger’s 
general theorem, which also applies only to cusp forms. However, we may 
think of the results of §1V.2, which were proved in an elementary manner, 
as a “prototype” for theorems such as those of Waldspurger, Kohnen— 
Zagier, Tunnell. 

Recall that if we could take 2 = 1 in Proposition 6, then the coefficients 
C\p, = L(x, 9) of Hy). would be essentially the class numbers of imaginary 
quadratic fields Q(/D ). There is actually an analogy between these critical 
values and the critical values L(E,, 1) in (4.6). For elliptic curves, a role 
analogous to that of the ideal class group of Q(,/D) seems to be played by 
the so-called Tate—Shafarevich group UI. It is the order of IT which appears 
in Birch and Swinnerton-Dyer’s conjectural formula for L(Z,, 1) (or for 
lim,., (s — 1) "L(£,, s) if L(E,, 5) has an r-order zero at s = 1). Formulas 
for the “average” value of the order of I have been conjectured which are 
analogous to classical results in analytic number theory for the average value 
of the class number of imaginary quadratic fields. For more about this, 
see the papers by Goldfeld et al. [1979, 1982]. 

We now return to Tunnell’s theorem, and discuss Tunnell’s explicit con- 
struction of the modular forms f and /” of weight 3 whose n-th or (n/2)-th 
coefficient gives the value of L(E,, 1). 
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Tunnell’s first task is to find all fe S3,. (I )(N), %) whose image under the 
Shimura map is the modular form ge S,(I)(32)) corresponding to L(£,, s). 
According to [Niwa 1975], any such f has Shim(/)€S,(I9(N/2), x7). So 
one might try taking N = 64. However, there is no guarantee that Shimura(/) 
is not in S,(T9(N’), x?) for N’ a proper divisor of N/2. For example, we 
know that any fe Sio(To(4)) has Shimura(f) in S,_, (0) = S,-1(F(1)). So 
in fact the f we want could be in S3,.(I'9(N), x) for Na multiple of 64. Tunnell 
computed that, in fact, no f of level 64 maps to g, but that all preimages of g 
under the Shimura map have level 128. 

The character y must be even, since Sy2(Po(N ), x) = 0 for odd x; it must 
have conductor dividing 128; and y? must be trivial, i.e., y must be quadratic. 
There are two such y: the trivial character y = 1 and the character 7, defined 
by %2(/) = @) for j odd. Tunnell determined that Shimura“ ‘(g) consists of 
two forms in Sso(F 9(128)) and two forms in S3/2(To(128), %2). Moreover, 
he found that these four forms of weight 3 can be constructed in a par- 
ticularly simple way: by multiplying a certain form f, of weight | by forms 
of weight 4 of the type O(mz). 

Up to a constant multiple, @(mz) is equal to O|[(¢ 9), m*/*]1;2, and so it 
easily follows from Problem | in §IV.1 that O(mz)eM,, (1 9(4m), %m) (see 
also Proposition 17 in $11.3). If we multiply this by a form /, (z) € M, ('9(128), 
xy), then the product /,(z)@(mz) is contained in Myp(Fo(128), XX—-m) When 
4m| 128. (Recall that M, (T9(128), 7) = My,(F9(128), %° X-1) by Proposition 
3 of §IV.1, and so the character for /;(z)@O(mz) is y¥- ¥~1° Xm-) 

The form /, is chosen to be 


fi @ = Y (Sy, 


m,neZ 


It is an easy exercise to show that 
fi (2) = (OZ) — O(4z))(O(32z) — 7O(8z)), 


and therefore f, € My.(fo(16)) -M4j2(F (128), x2)  M,(T (128), x-2). Ac- 
tually, f, also vanishes at the cusps of I')(128), and so we have f, € S, (I)(128), 
%-2). Thus, for m|32 we have 


J, (2)O(mz) €S3(F(128), Yom): 


It can be shown, by the way, that f, can also be written as a product 
fi: = n(8z)n(16z) = gq] — 4"). — 9°"). 


For a short proof using the Jacobi triple product formula, see [Moreno 
1980]. 


Proposition ({[Tunnell 1983]). The modular forms f,(z)©(2z), f; (z)@(8z)€ 
Syo(Fo(128)) and f,(z)@(4z), f, (2) O(16z) € $3). (F'9(128), x2) are a maximal 
set of linearly independent eigenforms for all of the T,2 whose image under 
the Shimura map is the modular form g = %b,q" € S,(19(32)) corresponding 
to L(E,, s). 
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Tunnell then proves his theorem by a close examination of what Wald- 
spurger says in the very special circumstances of the congruent number 
problem. In our situation, Waldspurger’s theorem boils down to the as- 
sertions that: (1) there is a linear combination f= La,q” of the preimages 
in S3).(f'9(128)) such that for all odd squarefree /, 


L,(X-15s 1) = cay, 
for some constant c; (2) there is a linear combination f” = Zajq" of the 
preimages in S3,.(I)(128), x2) such that for all odd squarefree /, 
L(4-21,5 1) = Cai, 


for some constant c’. Tunnell computes that one can take f(z) = f;(z)O(2z), 


c= B/4Jiy, and f(z) =fi()O(4z), c =B/2V/2l. Since L,(x-,, 5) = 
L(E,, 5), this gives his theorem with 


f(z) = (O(2) — O(4z))(©(32z) — 7O(8z))@(2z), 
f'(2) = (O]) — O(4z))(O(32z) — 7O(8z)) O(4z). 


The numerical identity (4.6) in Tunnell’s theorem is quite bazaar. By the 
formula (6.8) in §I1.6, it says that for m odd and squarefree: 
° dx 


~ na —amj[n/8 ag? pe eee 
8nd (7) is . | ‘ 


1 Vx-x 

where L(E,, s) = Xb,,m~* and a, is the n-th g-expansion coefficient in (4.9) 
below! 

Note that in (4.6) we are interested only in the odd g-expansion coefficients 


of fand f’. But for n odd, the n-th coefficient is the same as the n-th coefficient 
in 


O(z) (o«26) “ 5006)) @(2z) = > | ee ae = 5 y gq? ty +82? 


x,y,zeZ x,y,zeZ 
(4.9) 
and 
Q(z) (ec25 = $00)) @(4z)= YL gtxtty+322? _ 1 ghrtr tse? 
2 x,y,zeZ OPTS 
(4.10) 
respectively. 


(Notice that the coefficient of g'e in (4.9) and (4.10) is obviously zero 
if /, = 5 or 7 mod 8 for (4.9), /) = 3 mod 4 for (4.10); but this tells us nothing 
new, since in Proposition 12 of §11.6 we saw that L(E,, 1) =0ifn=/, =5 
or 7 mod 8 orn = 2/, = 6 mod 8.) 

Collecting coefficients of g’° in (4.9) and (4.10), we conclude the version 
of Tunnell’s theorem that we cited at the beginning of Chapter I. 
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N 
> 
~< 


n is a congruent number 


“elementary” (proved 
in Chapter I) 


E,,: py? =x3 — n?x has 
infinitely many rational points 


aN 
i ee 


“7 [> Birch-Swinnerton-Dyer 
conjecture 
Coates-Wiles: 


LE,» 1) =0 


Theorems of Shimura, 
Waldspurger, Tunnell 


the n-th q-expansion coefficient in 
Tunnell’s product of theta-functions 
is zero 


Figure IV.1 
Theorem ([Tunnell 1983]). [fn is a squarefree and odd (respectively, even) 
positive integer and n is the area of a right triangle with rational sides, then 


#{x, y, ZEZ|n = 2x? + y? + 3227} =4# {x, y, zEZ|n = 2x? + y? + 827} 


respectively, 
# {x y,zeZ 5H 4x? + y? + x21) 
1 n 2 2 2 ; 
5 x,y,zEeZ za + y* + 8z*> |. 


If the weak Birch—Swinnerton-Dyer conjecture is true for the elliptic curves 
E,,: y? = x? —n*x, then, conversely, these equalities imply that n is a con- 
gruent number. 


In Fig. IV.1 we recall the logical structure of the argument. 
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As mentioned in Chapter I, Tunnell’s theorem has the practical value of 
leading to an effective and rapid algorithm for determining whether n is a 
congruent number. In addition, one can give quick new proofs of certain 
conditions for n not to be a congruent number. For example, if ” is a prime 
congruent to 3 modulo 8, Tunnell shows that a, = 2 mod 4, and therefore n 
is not a congruent number. For this and other corollaries, see Tunnell’s 
paper. 

The only sense in which Tunnell’s theorem is not yet a completely satis- 
factory solution to the ancient congruent number problem is that in one 
direction it is conditional upon the weak Birch—Swinnerton-Dyer conjecture 
for certain elliptic curves. But lately, significant progress has been made 
toward a proof of that conjecture in enough generality to include the curves 
E,. As mentioned in §II.6, in the mid-1980s Gross and Zagier were able to 
show that the weak Birch-Swinnerton-Dyer conjecture is true for E, for a large 
class of n (but one must be able to show that L’(E,, 1) # 0). At about the same 
time, R. Greenberg showed that, if the conjecture were to fail for an elliptic 
curve such as E,, which has complex multiplication, then that would imply a 
highly improbable combination of consequences for the Tate—Shafarevich 
group of the elliptic curve (see [Greenberg 1983]). 

The next few years saw a series of striking developments which, while not 
bearing directly on the congruent number problem, brought us closer to 
having a proof of the Birch-Swinnerton-Dyer conjecture. First, K. Rubin 
proved that if the group of rational points of an elliptic curve E with complex 
multiplication has rank at least 2, then its L-function L(E, s) has a zero at 
s = 1 of order at least 2 (see [Rubin 1987]). In combination, the results of 
Coates— Wiles, Gross—Zagier, and Rubin imply the following: for an elliptic 
curve with complex multiplication, if the order of zero L(E, s) at s = 1 is either 
0 or 1, then the order of zero is in fact equal to the rank of the group of rational 
points. Then Kolyvagin was able to strengthen this result dramatically, by 
proving that the same theorem is true for the much broader class of modular 
elliptic curves. If the Taniyama—Weil conjecture is true, then this class includes 
all elliptic curves defined over the rational numbers. See [Kolyvagin 1989 and 
1990] and [Rubin 1989]. (For a readable overview of results on the Birch— 
Swinnerton-Dyer conjecture, together with a discussion of the relation of 
this work to Gauss’ class number conjecture, see [Ireland and Rosen 1990, 
Ch. 20].) 

It is remarkable that the nearly complete solution that we now have to 
such an old and naive question as the congruent number problem, has 
required some of the most powerful and sophisticated tools from diverse 
branches of twentieth century mathematics. 


Answers, Hints, and References for 
Selected Exercises 


§.1 


1. See [Hardy and Wright 1960, pp. 190-191]. 3. (b) Follow the proof that 

x* + y* =u? is unsolvable on pp. 191-192 of Hardy and Wright. 4. For fixed n and 
fixed x (so that Z is fixed), the triples that correspond to x come from the 
intersection of the two conic sections X? + Y* = Z? and XY = 2n in the XY-plane. 
Given one point of intersection (X, Y), the other three are (— X, — Y), (Y, Y), and 
(— Y, —X), and so do not give a distinct triple. 5. (a) 1681/144; (b) 25/4; 

(c) 841/4, 1369/4. 7. Since x, y? = 0, | or 4 (mod 8), it follows that 2x? + y? + 82? 
can never equal an integer n = 5 or 7 (mod 8). The first congruent number n = 1 or 3 
(mod 8) is 41, which is the area of the right triangle with sides 6335, 134, 1444. 


§1.2 
1. replace y by y/n? and x by x/n 2. (c) x = —n¥/((X + Z), y = 2n?(X + Z) 
(e) X Y Zz | x y n¢ Y Zz x y 
3 4 5 —3 9 3 4 -5 12 —36 
4 3 5 —2 8 4 3. -5 18 —72 
—-3 -—4 5 12 36 —-3 -4 -5 -3 -9 
—-4 -3 5 18 72 —-4 -3 -5 —-2 —-8 


3. (a) If Z is the side opposite 6 and X, Y are the other two sides, the law of cosines 
gives X* + Y? — 2AXY = Z?. Then the point u = (¥ — AY)/Z, v = Y/Z is on the 
ellipse u? + B?v? = 1. Again use the slope of the line joining (—1, 0) to (u, v) to 
parametrize this ellipse. Show that u = (1 — B?1?)/(1 + B?2?), v = 2¢/(1 + B?t?). Now 
the area of the triangle is XY sin 6 = BYY, so we obtain: 

n=4BZ?(Y/Z)(X/Z) = 4BZ7v(u + Av) = BZ24(1 + 2At — B27) (1 + B22), 
Finally, set x = — Bt, y = (1 + B?2*)/Z, so that ny? = x(x? + 24x — 1). (b) Since 
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A=(1—A?)/(1 + 4?) and B = 24/(1 + 47), the right side of the cubic equation in 
292 
! Hx-t)=x0— 90+). 


part (a) becomes x { x? + 


§1.3 


1. Counterexample if K = F, = Z/pZ: F(x, y, z) =x? —x. 3. 1;0;2 5. (6) Here’s 
a counterexample if char K = p. Let d be a multiple of p, 

F(x, y, Z) =x? + y? + 244 x*y'z*, where 0 < a, b,c, a+b +c =d. Then all partial 
derivatives vanish at (0, 0, 1), (0, 1, 0), (1, 0, 0), but these points are not on the curve. 
In fact, if p = d= 3,a = b =c = 1, then the curve is smooth at all of its points, even 
though there are three points of PZ where all three partial derivatives vanish. 

(d) With K = R or C, think of F as a map F: K? > K, so that the smoothness 
condition becomes nonvanishing of the gradient. Then apply the chain rule to the 
composite function: x’y’z’-space se xyz-space 4,K. 6. (b) Reduce to the case 

Z; =Z, = 1, set Ax = 7 (x2 — x4), so that f’(x,)Ax = 7o7(V2 — Y1). Then 

0 = F(x; + Ax, yy + f(x Ax + a, Ax™ + +++, 1) = F(x, + Ax, yy + f'(x,)Ax, I + 
E(x, FAX, IY, FS (XR )AX, Ia Ax™ + +> = (1+ 88 F F(x, + x98, 91 + Yat, 

Z, +2Z,t) + (nonzero constant)r” + higher terms. 


§1.5 


2. (a) f(2) = (e?™4, e?*"*) for z= aw, + bw2, a, bER. (b) N*. (c) In part (a), let 

a = j{p, b = k/p; then for j, ke F, not both zero we have (j, k) <> subgp gen by 
(e27uilP, e?iKP) gives the required one-to-one correspondence; there are p + 1 
subgroups. 3. (a) If s = 2, reduce to the case when f(m, n) = 1 ifm,n=m,, ny 
(mod N), f(m, n) = —1 if m,n = m,, n, (mod N), f(m, n) = 0 otherwise (where m;, 
n, are fixed pairs); then pair together the jN + m,, kN + n, term and the jN + m2, 


kN + ny term. (b) a Y fim, noo? Gia, + faz; @1, ©). 
_ * O<m,n<N 


§1.6 


2. g3 — 2793. 3. —’(z) = 69(z) —4g2. 6 (8) = 79/9450. 7. 4/3, 8/27 

10. Set v(z) = f-1(g(2)), find (dv/dz)*, and show that dv/dz = +1. 

14. (a) Substitute ¢ = sin? 6 and integrate by parts. (b) Use Problem 12(b), and 
substitute x = (t — e,)/(e, — e;) to get the expression under the radical in the form 
x(x — 1)(x — 4). (Note: An elliptic curve written in the form y? = x(x — 1)(x — A) is 
said to be in “Legendre form”.) Then make the substitution ¢ = 1/x. (c) Expand 

(1 — At)! in a binomial series in part (b), and use part (a). 


§1.7 


2. (a) ((x? + n?)/2y)?. (c) let ord, denote the power of 2 dividing the numerator 
minus the power dividing the denominator ; dividing into the cases ord,x < ord,n, 
ord,x = ord,n, ord,x > ord n, determine ord,(x? + n?) and use y” = (x? — n”)x to 
find ord, y, in order to conclude that ord, y = ord,(x? + n’). (Of course, ord,n = 0 
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or 1, since n is squarefree.) 3. (a) 0 together with 3 x-intercepts. (b) points of 
inflection. (c) from each of the 3 x-intercepts there are 4 lines which are tangent to 
the curve at points of order 4. (d) if we have a configuration of three lines crossing 
three other lines, and if the elliptic curve passes through 8 of the 9 points where they 
cross, then it passes through the ninth. 4. (a) eight; setting y’ = 0 after twice 
differentiating y? = f(x) implicitly and then multiplying both sides by 2y? gives: 
(Qyy’)? = 2y2f'”(x), and hence f’(x)? — 2f(x) f(x) = 0. (b) x = tnv1 + 2,/3/3. 

5. four; draw lines from — Q which are tangent to the curve. 7. 4; 3; 8. The four 
points of order 4 not of order 2 are found by drawing lines from (n, 0) which are 
tangent to the curve. 8. two points of order 2 (at infinity and (a'?, 0)); three points 
of order 3 (at infinity and ((4a)"*, +(3a)”) if a is positive, (0, +(—a)"”) if ais 
negative) ; four points of order 4, namely, the two of order 2 and the two points 
(a1 + /3), +Ba(3 + 2/3))"”) (if a is positive; change the two +’s to — ifa is 
negative). 


§1.8 


aoe 16x* + Sie + 32g,x + 93 
16(4x° — gx — gs) 
2. f3(z) = 3x* — 3g.x? — 3g3x — 74693, with x = gz). 3. Look at zeros and poles 
of ¢0(Nz) — go(z); determine the constant — 1 by comparing coefficients of z~?, (See 
[Lang 1978b, pp. 34-35].) 4. Considered on the points tw, + £e, in the z-plane, 
o must take every such point either to itself or its negative (modulo L). When 
finding the matrix entries by looking at the cases (j, k) = (1, 0) or (0, 1), we first 
obtain (*} ,°), but consideration of other j, k shows that both signs must be the 
same. The analogous situation in cyclotomic fields is to set Q¥ = Q(cos 4f) (i.e., 
adjoin just the x-coordinate of the point of order N). Then Gal(Q,/Q,; ) is the 
subgroup {+1} in (Z/NZ)*. 5. The image is conjugated by the change of basis 
matrix. 6. (a) a subgroup of order 2, (b) the trivial subgroup, (c) the entire group, 
(d) a subgroup of order 2. (Note: In this and the next problem, these subgroups are 
only defined up to conjugation; see Problem 5.) 7. (a) 48, (b) Using Problem 4(b) 
of 81.7, we see that K; is generated by +./1 + 2,/3/3 and by the solutions y of 
y? = xn? (2,/3/3) = +22 V3 + 2/3. (©) By part (a), [K3: Q] divides 48. Since it is 
obtained by successive extraction of square roots, K3 has degree in fact dividing 16. 
On the other hand, by part (b), it is easy to see that K, contains i and also the fourth 
root of 4n?(3 + 2,/3), which satisfies x* — 24n?x* — 48n*, which is irreducible (by 
Eisenstein’s criterion for the prime 3, if 3 tn; if 3|n, a generalization of Eisenstein’s 
criterion can be used). Thus, the field F obtained by adjoining this root has degree 8. 
But F c R, while K; 37, so that [K,;: Q] > 16. Hence, the degree is 16, and the image 
of the galois group is a 2-Sylow subgroup of GL,(Z/3Z). Here are two alternate ways 
of showing that [K;: Q] is at least 16: (i) K3 has at least degree 2 over 


Q(V 2n/3 + 2,/3) < R, so it suffices to show irreducibility over © of the polynomial 
Tg choices or + (X $V A2nV3 + 2,/3). But otherwise a product of 4 of the roots would 
be in Z, and this can be ruled out directly. (ii) First show that 

[Q(/3 + 2,/3): Q] = 4, after which it suffices to show that aw QnJ/3 + 2,/3) # 
Q(V3 + 2./3). Otherwise we would obtain 2n/3 + 2,\/3 = (a + bv3 + 2,/3), 


, with x = ¢(z). 
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a, be Q(,/3); since V3 + 2,/3¢ Q(,/3), this gives 0 = a? + b7(3 + 2,/3), 

a contradiction, since a, bE R. (d) (§ {), ie., 2, = @,/3, 22 = (@, + @)/3. 8. (c) 
Show that a is an eigenvalue of a 2 x 2 matrix with integer entries. 9. (a) 3, (b) 4, 
(c) 6, (d) 7, (e) 8, (fF) 5, (g) 7. 


$1.9 
1. Possible conditions on p and f: 8| f; or else 2| fand p = —1 (mod 12). 
2. (a) p | 3 5 7 11 13 17 19 23 


type | (2,2) (2,4) (2,4) (2,2,3) 2,4 44 (22,5) (2,4, 3) 


(b) (4, 4) (c) (2, 2, 7), (d) (2, 4, 9), (e) (2, 2, 9, 37). For g = 17 and 9, you have to 
check that the x-coordinates of the doubles of all 12 points not of order 2 are 0, 1 or 
—1,1e., all points are of order 4. Note that those cases also follow from Problem 11 
below. 3. Same as Problem 2 except for: p = 13, (2, 2, 5); p = 17, (2, 2, 5). 

4. Notice that the right side x? — a runs through F, as x runs through F,, so the 
number of points is the same as on y? = x (plus the point at infinity). 5. See 
Problem 10 below. 


6. p | 5 7 11 13 iY 19 23 
type | (2,3) (2,2) 43) (22,3) 29 227) (8,3) 


7. Show that for all but finitely many primes p = —1 (mod 6) the group 
homomorphism from the subgroup of points of order m in E(Q) to E(F,) is an 
imbedding. Show that this implies that points of finite order can only come from 2 
points of order 2 and/or 3 points of order 3. Then find whether the points of order 2 
or 3 can have rational coordinates. 9. (a) Show that a point of order N is taken to 
another point of order N by the complex multiplication automorphism; but if both 
(x, y) and (—x, ,/— 1y) have coordinates in F,, then /—lefF,. 10. Proceed as in 
9(a), using the complex multiplication (x, y)H> (¢x, y), where ¢ is a nontrivial cube 
root of unity in F,2. 11. Suppose that « < f. Let G be the quotient group of the 
group of points of order /’ modulo the subgroup of points of order /*. Then 

G = Z/l’-*Z. Show that the complex multiplication used in 9(a) gives an 
automorphism of the group G whose square is the automorphism — 1 (which takes 
every element in Z//*~*Z to its negative). Show that there is no such automorphism 
if / = 3 (mod 4), because — 1 is not a square in (Z//Z)*. If / = 2, use the fact that —1 
is not a square in (Z/4Z)*. 


SII.1 


4. Z(T) =(1— T’)"'?; Z(T) = (1 — T’)"'; take the equation x? — a = 0 for a any 
quadratic nonresidue mod p. 5. Z(T) = 1/((1 — T)(1 — 2T)--- (1 — (M — 1)T)). 
6. (a) ZT) = 1/0 —- Ty -— TY Od - Tey. re ‘yy (b) The limit of 
the Z(7) in part (a) as M approaches infinity, an infinite product which is not a 
rational function. 

7. (b)-(c) Z(Pz,/F,; T) = 1/0 — T)U — 47) —@T)---(l—q™T). 9. Write V 
as a disjoint union of affine varieties, and use Problem 1. To reduce to the case of a 
single equation, use induction and the observation that the number of simultaneous 
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zeros of f = g = 0 is equal to the sum of the number for f= 0 and the number for 

g = 0 minus the number for fg =0. 10. I1/(1 — T)(1 — qT). 

1M. (1 — q7)/(1 — @?T)0 — 4°). 

12. 1/0 —T)\I-qT)d—-@TyYrd-—@T)—_¢qT). 13. Following Problem 13 
of §1.9, you quickly see that the field extension generated by the points of order part 
is contained in F,:™. It remains to show that only the /M-order points, and none of 
exact order /@*1, have coordinates in F,:“~'. Prove this by computing the exact power 
of / that divides N,m-1. Consider separately the cases (i) / remains prime in the 
quadratic extension Q(q); (ii) the ideal (/) = LL splits into a product of two prime 
ideals. Show that, for example, if L° is the highest power of L dividing « — 1, then 
L**/ is the highest power of L dividing «” — 1. In this way, use the fact that N, is 
exactly divisible by /? to prove that Njw-1 is exactly divisible by /?". 16. Write 
ZT) =1+¢,T+e,T? +++: = P(T)O(T), Q(T) = by + O;T +--+, where P(T) 
and Q(T) have no common factors and all coefficients are integers. Check that the 
polynomial Q(T) is primitive. Use the Euclidean algorithm to write PU + QV =m, 
where U, VeZ[T], meZ, m #0. Write m/Q = U(P/Q)+ V=d,+4,T+---. 
Since m = Q(T)(d) + d,T + ---+) with Q(T) primitive, it follows by the proof of 
Gauss’s lemma that m|d). In particular, m|do, and this means that the constant term 
bo is +1. It immediately follows that P(7) also has constant term +1. 


SII.2 


1. For example, to prove (3), in the double sum for J(z,, %2)9(%1X2), replace x by x/y 
and then replace ybyx+y. 4 i/3, \/5,i,/7,3. 5. 1+ 2i,3, -3+2i, -1+ 41. 
6. (4) J(y, 1) = (4x — 4x7) = Ex(1 — (2x — 1)?) = Ux — x’) 

(replacing 2x — 1 by x) = Z(1 + x2(y))x(1 — y) (where y = x? if y.(y) = 1), 

and this equals J(z3, %). 7. (1 + 3i,/3)/2, (—5 + 3i,/3)/2. 8. (a) Use part 

(b) with ma square root of I/n in F,2. (b) Replace x by x/m? and y by y/m3 

in y? = x>—n?x. 9. (a) Replace x by x/a. (b) Choose J to be the ideal of 

elements x for which ax € I, and take the sum over a fixed coset in (R/I)* 

of the subgroup consisting of elements congruent to 1 modulo J; show that 

each such sum vanishes. (Look at the example R = Z, / = (N) to get used to the 
argument.) (c) Check that g(y, ¥) = 9% W) = x(— lg, W). Then we have 

lg WI? = 9% Wa W) = Lacey Lyeny LOL) W(X — y) (here it makes no 
difference whether we sum over R// or (R//)*). Replace y by xy in the inner sum, 
thereby changing the summand to 7(y)W(x(1 — y)). For fixed x not prime to J, if we 
let J denote the ideal of elements whose product with x is in J (thus, J is strictly 
larger than /), we see that the inner sum vanishes by the argument in part (b). So we 
can replace the outer sum Ly ¢cajy+ DY Ly eR. We then interchange the order of 
summation to obtain: |9(x, ¥)|? = Lye XY) Zeer W(x — y)). When y = 1, the 
inner sum is N/. But when y + 1 it vanishes, since, by assumption, w is nontrivial on 
the subgroup (1 — y)R+///in R/J. 10-17. See Weil’s paper [Weil 1949] or else 
[Ireland and Rosen 1982, §4 in Chapter 11]. 20. (b) By part (a), modulo 3 we 
have J(z3. %3) = Exs Wa)? = EBNWF7(0) = ExeryWBx) = — 1. (©) Iza, xs) isan 
element of Z[w] of norm p which is congruent to — 1 mod 3. (e) Since J(73, x3) has 
norm p and the right congruence, it suffices to show that it is in the ideal (a + bw) 
(rather than the other possibility (a + b@)). Working mod a + bw, replace 73(x) by 
x-DB and y¥3(1 — x) by (1 — x)'?"? in the definition of J(x3, 73), and use the fact 
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that Xx/ = 0 unless p — | divides j. 21. (a) Use Problem 6 and the Hasse— 
Davenport relation. 22. (a) a #0 and char K # 3. (b) Use: the map x x? is 
one-to-one from K to itself, and so x? = wu always has exactly one solution. (c) There 
is one point at infinity. For any fixed y, the number of x such that x? = y? + ay is 

1 + 43(9? + ay) + ¥3(y? + ay). Sum this over ye K, making the change of variable 
y = ax, so that 73(y? + ay) = ¥3(@)xX3(* — x7) = %3(a)x3(x)x3(1 — x) (recall that 
—1=1in XK). (d) Compute the case r = 1 directly; then use the Hasse-Davenport 
relation. Z(C/F,; T) = (1 + 277)/((1 — T)(1 — 27). (e) Completing the square in the 
equation y* + y = x? and substituting yw = y +4, x’ =x givesy?=x +14, ie., 
(8y’)? = (4x)? + 16. Then set y = 8y’, x = 4x’. 23. (c) The following tables give the 
number of points in factored form. The type of the group follows from Problem 10 
of §II.1 unless that number is divisible by the square of a prime / = 1 (mod 4). Those 
cases are marked with an asterisk and discussed below. 


r Nyes N;" r N, r N, 

1 23 2? 2: 2° 10 25- 401-761 

3 23-13 2? -37 4 27-5 12* 27-5-13?-37-61 
5 23-401 27-761 6 25-13-37 14* 25 - 29? - 337-673 
7 23-29-3372? - 29-673 8  2°-3?-5-17 

(d) 

r yes NY r N, r N, 

I 2 27-5 2 25-5 8* 2° -3?-5?-73-97 
3 23-277 2?-5-109 4*  27.3?.5? 10* 25-5?-101-461 -3701 


523-101-461 2?-5?-3701* 6 2°-5-109-277 


To handle the asterisked cases, suppose that £,(F,) contains exactly / points of order 
I: jP,0 <j </; and that E,(F,r) contains exactly /? points of order /? of the form jQ, 
O<j<?. Let yc Far be the extension generated by the coordinates of Q. 
Following Problem 12 of §II.1, use the map o> o(Q) — Q on Gal(F,+’/F,) to show 
that r’ = /. Use this to show that the / part of the type is (/, /) in all cases except Nz” 
and N,o in part (d). In those two cases, show the type is (/*) as follows. If there were 
/? F,-points of order /, let F,r' be the extension of F, they generate. By suitably 
choosing a basis {P, Q} for the 2-dimensional F,-vector space of points of order /, get 
an injection of Gal(F,’/F,) into matrices of the form (§ 2) = F*. Thus, r’|/— 1. 


sI1.3 


4, By (3.11), we have g((5)) = Ea.b=0,1,....p-1 (op erm? =p — 1 + 

Der} Dpz} (*%)e?"!4”, In the inner sum replace b by ab to obtain g((5)) = 

p— 1+ (2bzf ePnlary api AH) = p— 1 — SEB) = 

2p — 1 — Epz3 (1 + 42%) = 2p — 1 — # fa, beF,|a? = 14+ 57} = 

2p —1— # {x, yeF,|xy = 1} (after the change of variables x = a+ b, y=a—b; 
a=(x + y)/2, b = (x — y)/2). Hence, gG)) = 2p —1-(p— 1) =p. 


sIl.4 


2. (b) Note that L(y, s) = 2-6 x(b) f(b), and use part (a) together with Problem 
9(a), (b) in $11.2. (c) L(y, 1) = — Ng) Zace X(@) log(1 — 7%). (d) L(y, 2) = 
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bg) Lace Kae). 3. (a) 2 e2x10 7H, (@) nM T(S)NELYY, 5) = 
mn O-92T(158)g(y) L(x, 1 — 5). (f) Set s = $ in part (e). (h) L’(y, 2k) = 
4(— Kk! 2 NAGE (kK + ZL, 2k + 1). 5. (a) Integrate by parts in the 
definition of the Fourier transform of f(x). (b) —iyt 7327-7, 
(e) mS*2T (SSL) NSL(y, 8) = —igQn 7-9? (235)L(%, 1 — 5). (a) For x(n) = (3) we 
obtain: L(y, —1) = (e727?) — Ke?"")), where I(x) is the function in Problem 
2(d). 7. Use part (a)(iii) of Problem 4. 8. (a) Add the expression for 
Ka, 1 — s) + 11 — a, 1 — 5) in Problem 3(c) to the expression for 
l(a, 1 — s) — (1 — a, 1 — 5) in Problem 5(d) to obtain: 

nS*12T(s/2) 


l(a, 1—s)= rd — ye s)+€(1 —a,5)) 


in SPT (5 + 1)/2) | 
(1 — s/2) ((a, s) — (1 — a, 5)). 


SIL.5 


2. (a) 6(t) = 40(1/t). (b) Let $(s) = [2 8(0(t) — 1) + J} (0) — 2). As in §IL4, 
show that $(s) is entire, and for Re s > 1 is equal to [2 S(O) — I) +3 +i5= 
L4 ho + 21 (8) Zo smez2|m|-7% = 4+ 7h + wT (8/40 (5). By substituting 1/t for ¢ 
in the integrals for o(s), show that (1 — s) = ¢(s), and hence 1™*T(s)¢x(5) = 

nm T(1 — s)€,(1 —s). Finally, show that 4¢q(s) = #5(#(s) — 4 — rhs) is analytic - 
except at s = 1, and that lim, ,, (s — 1)4¢x(s) = lim,., 7°/'() = x. 3. Since the 
Fourier transform of g(x) = e2% *e7™ +! is g(y) = Le2M™ OMe “mye”, One 
obtains: 6°(t) = te72"™""9",(2). 4. (a) (2miw- WFC). (b) CZ)HLe2RH MEMO, 

(c) 6,,4(t) = i *t-*-1 6"*(4). (d) Let a + bi run through a set of coset representatives 


modulo J = (n’), and write x in the sum in the form (a + bi) +n'(m, + mi), so that 


k 
n \- . tu): . 
the sum becomes n*|n’|~?*Xg45:4(@ + bi) Lez? ae (where u, + u,i= 


(a + bi){n’). The inner sum is essentially the Mellin transform of 0, ,(¢). The 
functional equation in part (c) will then give a linear combination of terms of the 
form {2 ¢**!~s9"*(t)4, and this linear combination can be expressed in terms of a 
Gauss sum and our original sum with s replaced by k + 1 — s. (e) Suppose the 

(09; 6;) for our Hecke character 7 is (k,, kz) (this pair of integers is called the 
“infinity type” of Z). Consider ¥ as a function on elements as well as ideals of Z[i] by 
defining 7(x) = Z((x)). Let k = |k, — k,|. If k, > kz, then the mapping 

xe x(x) = 209/x*(Nx)2 is easily seen to be a character of (Z[i|/f)*; if ky > k,, then 
x(x) = 7(x)/x*(Nx) is a character of (Z[i]/f)*. In, for example, the case k, > k,, 
the Hecke L-series is then LY acer LX)(Nx)§ = ZL x*7(x)(Nx)2*. 5. (a) Make 
the change of variables x’ = Mx to obtain g(y) = Jane 7" nae a 
jaamS(M*y), since M~*x’-y = x’ M*y. (b) L’ = M*2Z"; let g(x) = f(Mx). Then 
Let A(X) = Umez GM) = LmezrG(m) = amy Dyer S(Y) by part (a). 

6. (a) M* = rate ?); considered in C, L’ = 24Z[w]. Note that Tr xy = 2x-y, 
where on the left x and y are considered as elements of C, and on the right as 
elements of R?. (b) In Problem 5(b), let f(x) = el" g(x) = f(Mx) with M as in 
part (a); then apply Poisson summation. (c) Let 6(¢) be the sum on the left in part 
(b), and let o(s) = JR ya0°(0() — 1) + J3*3 COW — aay For Re s > | show that 
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$(s) = (2/./3)5(4 + z4,) + 21 (5)66,(5). The residue at s = 1 is 7/3,/3. Finally, 
replacing ¢ by 4/3: in the integrals for ¢(s) leads to the relation: $(s) = 
(2/,/3)?*"' @(1 — 5), and this leads to A(s) = A(1 — 5) for A(s) =  (4/3/2n)T(s)Cx(s). 
(d) Use the Euler product form ¢,(s) = TI,(1 — (NP)~)? For. a prime ideal P of 
norm p = | (mod 3), the contribution from P and P is (1 — p~‘)~?; for P =(p), 
where p = 2 (mod 3), the contribution is (1 — p-74)"! = (1 — p) "(1 — x(p)p™)"!; 
and for P = (,/—3) the contribution is (1 — 3~9)"'(1 — 13)3-)2 (since (3) = 0). 
So the Euler product is the Euler product for ¢(s) times the Euler product for L(y, s). 
(e) Multiplying the functional equations for ¢(s) and for L(y, 5) gives A(s) = A(1 — 5) 
for A(s) = 2 PT G)f(s)3/a)PT((s + D/2)LO, 5) = (V3/2)Cx(T PCs + 1)/2) = 
const - (,/3/27)*T'(s)Cx(s) by (4.4). 7. See Problems 20-22 in $11.2; (d) gy, W) = 3. 
8. (a) By part (1) of Proposition 9, g(y) equals e?"“’” times the Fourier transform of 
(x-w)e"™*l ; note that x-w = Mx-(1, i) with M as in Problem 6(a); then proceed 
as in Problem 6(b) to obtain g(y) = ry -(—a, Ie? %e7 473972. DI? (by Use 
Problem 7(c) to obtain ¢(s) = 37*"'n~*I'(s)6L(E, 5). (d) Replacing ¢ by + in the 
integrals in #(s), one obtains $(s) = ($)*"' Lx(a + ba) [F 1?-*0"(), where the 
summation is over 0 < a, b < 3; u = (a/3, b/3); and 

O"(t) =Z,ez2m-(—e, Yer™™e ul"? Then for Re (2 — s) > 3/2 use (4.6), 
Problem 2 of $11.2 (note that u-m = 3Tr((a + bw)(—@m, + m))/i,/3)), and the 
evaluation of the Gauss sum in Problem 7(d). The result is: (s) = 
(4/3) 32°72 — s)6L(E, 2 — 5). Equating this with the expression in part (b) and 
collecting terms gives the desired result. 


SII.6 


2. The function f(s) = A(1 + s) is even in the first case (so that its Taylor expansion 
at s = 0 has only even powers of s) and odd in the second case. 3. (c) Use: 


bye = 4 + 05 GH, + -+- + Hf, and 2a, =a, + G,. 


5-6. n | first few nonzero b, , L(E,, 1) | remainder estimate 
2|6,=1,6,=2,b,= —3 0.92707 |R,3| < 0.00027 
3| b, = 1,6, =2,6,;=6,b,,= —2 1.5138 |Ry5| < 0.00123 
10 | b, = 1, by = —3,6,;=6,6,7= —2 | 1.65 |Ry9| < 0.289 


7. You want |Ry+,| to be less than c/2, i.e., c/2 > 4(1 — eT) “be MHLW for 
large n the right side is asympotic to 4./N’e7™*™’, 50 choose 

M > 1\/N’ log(8./N’/nc) © 1/N’ log n, i.e. (2n,/2/n)log n for n odd, (2n/n)log n for 
neven. 8. (a) Use Problem 3 to find the 4,,,,,. (b) See Problem 7 of §I.1. 


SII.1 


1. Tel, (N) <I,(N), but STS~' ¢T,(N); hence, neither I, (N) nor T'o(4) is normal 
inl. 2. (a) Given Ae SL,(Z/NZ), let A be any matrix with 4 = A mod N. Find 

B, CESL,(Z) such that BAC is diagonal: BAC = (4 9), where ad = det A = 1 

(mod N). It suffices to find 4 = (74% 2%) with determinant 1, since then 

BAC €SL,(Z), and BAC = B"'(4 °)\C-! = A = A(mod N). To find A so 
that 1 = det A= 14+ ((ad— 1)/N + xd)N — zN°, first find x so that 

xd = —(ad — 1)/N (mod N), and then choose z so as to make det 4 = 1. 
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3. (a) (q? — 1)(q? — 9); (b) q(q? — 1). 4. (b) Since the kernel in part (a) has p**~”) 
elements and #GL,(Z/pZ) = (p? — 1)(p? — p), it follows that #GL,(Z/p*Z) = 
p*?-3(p? — 1)(p — 1). (c) Divide the answer in (b) by $(p*) to get p**-7(p? — 1) 

5. Use the Chinese remainder theorem. 6. N3II,iy(1 —p~”). 

7. N3Uyiy(1 — po2); Ns NU yy — po); N20 iv — po); NTT yiw(l +p) 

8. The image of I'(N) under conjugation by (% ~3) is the subgroup of I'y(N) 
consisting of matrices whose upper-left entry is = 1 (mod N). 10. and 14(b). See 
Figures A.1-A.2. 12. Besides I and I’(2), there are four, namely, the preimages of 
the three subgroups of order 2 in S; and the one subgroup of order 3 under the map 
I > SL,(Z/2Z) = S3. (i) To(2) = (3 *) mod 2}, Fo(2) = the right half of F(2); 

(ii) 1°(2) = {(& 2) mod 2}, F°(2) = FU T7'Fu SF; (iii) G2) = { ¢>)=TJorS 

mod 2}, fundamental domain = FU T7! FU T“'SF; (iv) {(@ 5) =F, ST or (ST) 
mod 2}, fundamental domain = FUT™'F. 13. (a) This can be proved using the 
fundamental domain, as in the proof of Proposition 4 in the text. Here is another 
method. Let G denote the subgroup of 6(2) generated by +S, T*. Clearly G c G(2). 
Conversely, write g € G(2) as a word of the form +S“ 7°:ST* --- ST®!, where 

a, =Oor land b;#0,j=1, ...,/— 1. Use induction on / to show that geG. We 
work mod +/, so that S? = (ST)? = 1. Without loss of generality we may suppose 
a, = 0, b, #0, since we can always multiply g on the left or right by S without 
affecting whether ge G. For the same reason we may suppose that 5, = 5, = 1, since 
T’€G. Note that / = 1 or 2 is impossible, since T, TST ¢ G(2). If / > 2, write 

g = TST” ---. Since (STS)(TST) = 1, we have TST = (STS)"' = ST~'S, and so 
T?Sg = TST*2"'§ ---, which is just like g but with b, replaced by b, — 1. If b, > 0, 
use induction on b, to finish the proof. If b, < 0, write ST~?g = ST 1ST” --. = 
TST*2*! ..., and again use induction on |6,|. (b) Use: P°(2) = TG(2)T™*, 

T9(2) = STG(2)(ST)'. (c) Let G be the subgroup of F generated by T? and ST~*S. 
Since G < F'(2), it suffices to show that [[': G] = 6, e.g., that any “‘word” in S and T 
can be multiplied on the left by elements of G to obtain one of the elements a; used in 
the text for coset representatives. Use S? = (ST)? = | and induction as in part (a). 
(d) Use part (c) and the isomorphism in Problem 8. 14. (c) Fo() is bounded by the 
vertical lines above 4 and —4; arcs of circles with diameter [0, 2] and 

[—2/(2p — 1), 0]; and arcs of circles with diameter [—1/(k — 2), —1/k], 


(The centers of the 
boundary arcs are at 
—1/2, 0, 1/6, 4/10.) 


x 
0 1/2 


Figure A.1 
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Figure A.2. Two possible fundamental domains for I',(4): 


I. Let F(2) be the fundamental domain for I'(2) in the text. Then aF(2), where 
a =( ~), is a fundamental domain for I')(4) = aI (2)a7! (see Fig. A-1). 

Il. UR a; F is a fundamental domain for I')(4), where F is the fundamental domain 
for T = SL,(Z) given in the text, and a, are coset representatives for T modulo 
T,(4). Here we have taken: a, = 1,a, = S,a;= 7 'S,a,=T ?S,«,= T'S, 
%, = STS. (In Fig. A.2, the number j labels a; 'F; the solid boundary arcs are 
centered at 1, —4, —3; the dotted arcs are centered at 0, —1, —4, —2, —4; 
the point P is (—7 + i,/3)/26.) 


k =3,...,p. For example, Fo(3) is the union of the regions marked 1, 2, 3 and 4 in 
Figure A.2. 15. See Problems 12 and 14(c) for counterexamples. 16. (b) (i) 2; 
(ii) 2; (iti) 2; (iv) 1. 


SIIL.2 


1. In the sum for G,, group together indices m, n with given g.c.d. 2. Substitute 
z= iin Proposition 7 to obtain: £,(i) = 3/n. 3. (a) Since both sides of each 
equality are in a one-dimensional M,(I), by Proposition 9(c), it suffices to check 
equality of constant terms. (b) Equate coefficients of g” on both sides of the 
equalities in part (a) to obtain: 0;(m) = 03(n) + 120 2°71 03(j)o3(n — J); 

I1og(n) = —1003(n) + 210,(n) + 5040 X82} o3(/)os(n — Jf); 

3, 3(n) = 21a,(n) — 200,(n) + 10080 X72 o5(j)o,(n —j). 4. (a) Since the left side is 
in S,,(I), by Proposition 9(d) it suffices to check equality of coefficients of q. 

(b)t(n) = $8504 1m) + $2ba5(n) — 834 E27} 06(j)o5(n — jf). (c) Consider part (b) 
modulo 691. 5. Let F(X, Y) be a homogeneous irreducible polynomial satisfied by 
X = E,, Y = Eg. Substituting z = iin E,(z), E¢(z) leads to a contradiction, since 
E,(i) = 0, E4(i) #0. 6. (b) Use part (a) with x = e~?" = e?*, along with the 
relation 0 = x@3p Basi Eas (i) = a@en Bart — DX, o,(n)g" with q = e?™. 7. Use 
Proposition 7 and the derivative of the identity f(—1/z) = z*f(z). 8. (a) By Problem 
7, the right sides are in M,(I-) and M,(I), respectively. Now proceed as in Problem 
3(a). (b) 2165(”) = 10032 — 1)o3(n) + o(n) + 240 D92} 0, (JJog(n — Jf); 

206,(n) = (42n — 21)05(n) — o,(n) + 504272} 0, (j/)os(n — j). 9. (a) Use the fact 
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that n? = 1 (mod 24) if n is prime to 12. (b) Use Proposition 9(d) to show that their 
24-th powers are equal. (c) 12, (1 — 9") = Zya 44 moa 12 9 24 — 

Dyes moar 724, 10. f = 2564-7 + 14 A724 271)? j = 1728 if A = 1; if 
2 = a/b in lowest terms (with a and b positive) and je Z, that means that 

a‘ b*(a? + b)? divides 256(a* + a*b? + b*)?, but since a, b and a? + b? each have no 
common factor with a* + a7b? + b*, that means that a?b?(a? + b?) divides 16, and 
now it’s easy to eliminate all possibilities except a= b = 1. 


SIII.3 


4. (a) Since [9(V) = +T, (1) in those cases, there is no difference between 
T)(4V)-equivalence and I, (V)-equivalence. (b) — 1/2 is an irregular cusp for I'(4). 


5. Group | To(p) | To(p’) | P(2) 
Cusp coo | 0 | 0 | —I/kp, k=1,...,p—1] ©} 0 |} -1 
Index 1 | pj 14 p? 1 2.42 2 


7. (a) See the proof of Proposition 18; replace a by X in (3.11). 8. (a) Replacing z 
by z + 1/2 in Ze?""" gives E(— 1)"e2"?"" ; meanwhile, the right-hand side is 

2D e272" (bh) works the same way. (c) On the left side, the constant term is clearly 
zero ; the coefficient of q” for pfn is (—2k/B,)o,-,(n); if pln but p/n, then the 
coefficient is —2k/B, times o,_,(n) — (1 + p*~')o,_, (n/p) = 0; if 2 = p™ny with m > 1 
and p/no, then the coefficient of q" is 

— 2k/B, times o,_;(p"No) — (1 + p*?)ox-1(p™ *No) + p** oy-1(p™ 7 No) = 

O41 (Mo) (O,—-1(P™) — CL + ph) (Pp *') + po, (p™ 7)) = 0. 

(d) Use parts (b) and (c) with p = k = 2. (e) Rearrange the infinite product of 

(1 — e482)" = (1 — (—1)"q") by writing 1 + gq" = (1 — q?")/(1 — 9”), getting 

Thy even’ Un twiceanoda/noaa (Where II denotes II(1 — g")). But this equals 

18 keer : I, even/Tann : Tl, twice aneven — 173 ,./T, : Taj,- 9. See the proof of Proposition 
30. 10. (a) Since 7°(z)n8(2z) € Sg(I'9(2)) by Proposition 20, to show invariance of 
n° (4z)/n*(2z) under [y], for yeT,(4) it suffices to show invariance of 
n°(4z)9°(z)n*(2z) under [y],o for y€I'9(4); now use Problem 9 to show this. At the 
cusp 00, we see that 7° (4z)/n*(2z) = qld. — q*")®/(1 — q?")* has a first order zero; to 
find 4(0) we apply [S],: 2 74°(—4/z)/n*(—2/z) = 

2*(2/4)*n(z/4)(—(z/2)?n*(z/2)) = — eel — 93)8/(1 — 93")*, which approaches 
—¢a as z—> 00. To find the value at the cusp —4, which is equivalent to the cusp 

4 = T(—1/2), we can apply [(3 9)]9, as follows: 


s( 42 2z+1\* ,f 1 1 
n WSN ge = 
Q2 + 1)? 2z+1 4z 4z 2 


=(2 1)? 
oe yn eea ey ae 
NAA ae ie ee 
—1 
24 
m(Z 
: 2 
= a = e- 28s ; = ; : ; by Problem 
Z ot (Ge )at (3) emeent(Z) 8(e) 
Z Zz 2z 
1 _9?°(2z) 


~ 16 n8(z)n8(42)’ 
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which approaches 7g as z > 00. 
(b) E,(ST Sz) = E,(S(—a — 2)) = (a + 2)°E,(—a — 2) + $(—a — 4), but 
E,(—a — 3) = E,(—4), so this equals (a + 4)*(z?E,(z) + ¥) + $(-a—}) = 
(az + 1)°E,(z) — (az + 1). (c) Obviously F|[T], = F. Using Problems 1 and 10(b), 
we have —24F(z)|[ST~*S], = E,(z)|[ST~*S], — 3E,(2z)|[ST~?S], + 
2E>(42)|[ST1S], = E,(z) + 24 ata — 3(E,(22) + Bray) + 2(E,(42) + Sade) = 
— 24F(z). We now look at the cusps. First, we clearly have F(oo) = 0. In evaluating 
F\[S],, ignore terms which approach zero as z > 00; then, using Proposition 7, 
obtain F(z)|[S]_ ~ —sE,(z) — 3(¢E,(2/2)) + 2G5E (2/4) > —(1 — 3+ %) = 
—¢z. This is F(0). To find F(—4), we similarly ignore terms which approach zero: 
—24F(z)|[ST~?S], = E,(z)|[ST~?S], — 3E,(2z)|[ST 1S], + 2(22 + 1)7? (#45) 
by Problem | above, and, by Proposition 7 and part (b), neglecting terms which 
approach zero, this becomes £,(z) — 3E,(2z) + 2(2z + 1)-?(-4 — }° E,(-4-)= 
E,(z) — 3E,(2z) + @z ?(—E,(— 42) + 6E,(—3z) — 4E,(—2)) by Problem 8(d); again 
applying Proposition 7 and neglecting small terms, we obtain 
E,(z) — 3E2(2z) + 4(— 16E,(4z) + 24E,(2z) — 4E,(z)) 91 —3+4(-16+ 24-4) = 
—3, so that F(—4) = —34(—3) = %. (d) Since the only zero of n°(4z)/n*(2z) is a 
simple zero at 00, it follows by Proposition 18 that F(z)/(q°(4z)/n*(2z)) € Mo(T9(4)) is 
a constant. To get the identity, write (1 — g*”)/(1 — q*") = (1 + q?”). (e) First show 
that IIo! A(z + j/N) €S;2y(To(N7)), using the same type of argument as in the 
proof of Proposition 17(b). Then set f(z) = A(z)*/MA(z + j/N), and take the 
logarithmic derivative of the equality f(yz) = f(z) for yeT,(N7). 11. (a) Prove 
invariance under [7], for yeI(4) as in Proposition 30. At the cusps, clearly 
©*(co) = 1; at 0 we have ©*(z)|[S], = z27-?@*(—1/(42/4)) = 27 2(—z?/4)O4(z/4), 
which approaches — 4 as z > 00; at $ we have ©*(z)|[ST~?S], = 
(2z + 1)~?O*(z/(2z + :1)) = (2z + 1)? 04*(-1/(4(— 1/42 — 1/2))) = 
—(2z)"?©*(—1/4z — 1/2) by (3.4), but by Problem 8(a) this equals 
— (2z)-?(20(— 1/z) — O(— 1/42))* = — 4221)? @(Z/4) — V2/iO@)* = 
(©(z/4) — ©(z))*, which approaches zero as z — 00. (b) Use: ©*(00) + 0 = F(00). 
(c) One can proceed directly, as in Problem 10(a). Alternately, write it as 

A(2z)__ n*(2z) 
(n(z)n(22))* (42) 
easier method is as follows. Let f,(z) = 9*(2z)/n8(4z), and let f,(z) = 
?°(2z)/n*(z)n*(4z). In the solution to Problem 10(a), we saw that for 
a= ST~?S = —(5 }) we have f, = 16f,|[o],. Since /, is invariant under [y], for 
yeT (4), it follows that /, is invariant under « 'T'9(4)«, which is easily checked to be 
equal to T,(4); finally, since « keeps 0 fixed and interchanges the equivalence classes 
of cusps co and —4, we see that /,(0) = 16/,(0) = —4, ,(—4) = 16f,(«~) = 0, 
t2() = 16f;(—4) = 1. (d) Same procedure as 10(d). (e) Use Problem 8(e). 
12. Follow the proof of Proposition 19. 13. Write (4(z)n(3z))® = 
A(z) (n(3z)/n3(z))°, (n(z)n(7z))> = A(z) (n(7z)/n7(z))°.. 14. Check the generators T? 
and S, using (3.4) to get #(Sz) = ./z/i¢(z); to check the cusp —1 = (TST)! 0, 
write $*(z)|[TST], = O*(z/2)|[G ?)]2 (see Problem 1). 
15. (a) For y = (@ 2)eET(N) note that ayyay' = (_4, ~2%)eTo(N), and so 
(Calera) ale = (f [Low vee” Je) [Loew Ji = xa) f \Loew Je: But x(a) = x(a). 
(b) For fe M,(N, x) write f= f* +f~, where f* = 3( f+ i*f|[ay],). (©) By 10(d), 
F(z)|[aa]2 = —i6n*(2)/n*(2z) = —¥e + 39+ -+- = —7{¢O* + F; matrix is 
Ch 1%): MF(4, 1) = CLO*, Mz (4, 1) = CGF — 40%). 
16. (a) Loy) = Liman im) > = Ej PLM = Os + 1 — ks). 


and use Propositions 17 and 20 and Problem 10(a). An even 
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SC SB 


Figure A.4 


(b) L,(s) = T1,[0 — pl — p49) = 1, = opp + phy. 
(c) Ly (s) = ,[0 — x(p)p*) — x(p)ps**)\* 

© = T= xp) (p)p* + py?" 
17. (a) Any point '-equivalent to i must be Iy(4)-equivalent to one of the points 
a; 'i, where F = ()?_, a1o(4). In this way, find that i, (2 + #)/SeInt F’ are 
T’-equivalent to i; w and (5 + i,/3)/14e Int F’ are I-equivalent to w; the two 
T9(4)-equivalent boundary points (— 1 + #)/2 and (3 + i)/10 are I’-equivalent to /; 
and no boundary points are '-equivalent to w. (b) Follow the proof of Proposition 8, 
but with slightly more involved computations. Note that the three “‘corners”’ 
(—3 + i73)/4, (1 + 1/3)/4, 9 + i/3)/28 are all [)(4)-equivalent, and the sum of the 
angles at the three corners is 360°. To illustrate the elements in the integration 
around the cusps and along the circular arcs, let us compute (27/ Sf @azif2) 
over the contour ABCDEF pictured in Fig. A-3, where we suppose that f(z) has no 
zeros or poles on the contour (but may have a zero or pole at the cusp 0), and we 
ultimately want the limit as ¢ > 0. Here BC is a circular arc of radius ¢ centered at 0. 
The element a, = (} ?)€I(4) takes the arc from A to 0 to the arc from D to 0. The 
image of B is very close to C, and so we can use the integral from D to «, B to 
approximate the integral from D to C. First, we use S to take the arc BC toa 
horizontal line between Re z = —3 and Re z = 1 (see Fig. A.4). By (2.24), we have 


“f@Q4 =| “ee -*[¢ 
SO Neate tae 


B 
But the latter integral approaches 0 as ¢ — 0 (since the angle of the arc BC 
approaches zero), while the first integral on the right approaches — v9(/) (as in the 
proof of Proposition 8, but we use the map q, = e?"”* into the unit disc). Next, by 
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(2.24), we have 


"£@ ,.(°£Og (LO, _ ("LO edz 
MOF, as ee ee ee ae 
i 7@) "J. 7 ~), FO J, fo” ear 


~ -K dz --«{" dz 
a? + 1/4 (-2+iv3y4 - 


Similarly, using «, = (3 =1)€T,(4) to take the arc DE to the arc FE, we find that the 
sum of the integrals over those two arcs is equal to —k times the integral of “2 from 

(2+ i,/3)/28 to 4. These two integrals are evaluated by taking In z between the limits 
of integration, where the branch is determined by following the contour. As a result, 
we find that the sum of the two integrals is —k times the logarithm of 


1/4 1/4 7 
(—2 + i/3)/4 (2 + i/3)/28 ; 
where, if we keep track of the contour, we see that we must take In(—1) = —7i. 


Thus, (27i)~* times the sum of these two integrals is equal to k/2. For a systematic 
treatment of formulas for the sum of orders of zero of a modular form for a 
congruence subgroup, see [Shimura 1971, Chapter 2]. (c) The only zero of ©* is a 
simple zero at the cusp —4; the only zero of F is a simple zero at oo. (d) If 

Se M,(To(4)), apply part (b) to the element f — f(oc)@* — 16f(—4) Fe M,(T,(4)), 
which is zero at —4 and 00, and hence is the zero function. (e)—(f) See the proof of 
Proposition 9. (g) Look at the value at each cusp of 

f=a0'? + b0°F + cOt*F? + dF?: f(«w) =a, f(—4) = d/16°, so a= d= 0; then 
#0) = (—2 — &)(—a&a)(—B, so. c = — 16b. (h) Let f(z) = 9'2(22). Since 

f(z)? = A(2z)e M, (T(2)), we immediately have vanishing at the cusps. Let 

a =(_} 9). By writing 2ez = —1/(1 — 1/2z), show that f|[«], = —f, and so 
S¢Mo(Uo(2)). However, I'y(4) is generated by T and a”. Next, since S,(I'o(4)) = 
C(@8&F — 16©*F7?), to check the last equality in part (h), it suffices to check equality 
of the coefficient of g. 18. (b) If fe M,(4, x), by subtracting off a multiple of ©? we 
may suppose that (co) = 0; then M,(4, 1)af? = azq? + ---. But a multiple zero at 
co would contradict Problem 17(b), unless fis the zero function. (c) First show that, 
for k > 5 odd, S,(4, 7) = ©~7S,,,(4, 1); then, by Problem 17(/), dim S,(4, 7") = 

[(K — 3)/2] (where [  ] is the greatest integer function). (d) Use Proposition 20, 
Problem 17 (with T’ = (2), « = (2 9)), and part (c). 


SIII.4 


2. Show that ./—i(ez + d)/,/ —i(—cz — d) = i-**"¢ (it suffices to check for z = 0), 
and then divide into four cases, depending on the sign of cand d. 3. (b)—(c) Use the 
O(afz) _ O(aBz) O(fz) 

OZ) (fz) O(z)” 


relation 


SIIL.5 


2. (a) By (5.27) and (5.28), T,(/|[aw].) = 0? Lae S|Lonoa(§ 2) ]q- Let 
Oa = O,tyFa(G bay! EA"(N, {1}, Z), and show that the «,,, are in distinct cosets of 
T,(N) in A"(N, {1}, Z), and hence form a set of representatives; so, by (5.27), 
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(Tf) |Lon i = nD, of [Loa oen lk = nl Y f\[o,%n Fao De = TCS [Lon ]). 

because f|[o,], =f. (b) T,F = 0, T,0* = O* + 16F, so {F, 3F +3 £04} isa 

normalized eigenbasis. (c) (7,F | [oa]e = = 0 + T,(F|[a,]2). (d) Any eigenvector of 7, 

or [a4], must be an eigenvector of 7,,; this includes F, 2F + 3,0*, ©%, and so all of 

M,(To(4)); by Proposition 40 applied to F, we have 4, = o,(”) (n odd). (e) Let 

©@*(z) = La,q". For n odd, a, = 6;(n)a, = 80, (n) by Proposition 40. For n = 2no 

twice an odd number, compare coefficients of g”° in T,@* = ©* + 16F to get 

Gy, = A, + 160,(%9) = 240,(n). For n = 2n, divisible by 4, compare coefficients of 
q™ in T,@* = ©* + 16F to get a, = 4, +0 = 240,(no) by induction. 3. The first 

part follows from (5.19) with n = 0; a counterexample for other cusps is in Problem 

2(b), since ©* + 16F does not vanish ats = —1/2. 4. (a) Both 7, fand T,f are 

equal to n)-! Xf \Louls +)], over the same a, b, d, because g.c.d.(n, M) = 1. Note 

that g = M-*?/|[(4 9)],. Thus, M!2 Tyg =n"! ELIE Sous th. where 

a, = (# °) mod MN. Set a, in the sum for 7, fequal to (§ $)o,(9 $)”*. Then 

ME T.g = (WOE FLaS M18 DCE DWC Dh = TAILS $V ]as because in 

(2M) = (M 9)(4 4)(M 9)! the entry 5M runs through a complete set of residues 

soil d. (b) Since S,(I'p(2)) = Cf, fis an eigenform for T, ; then by part (a), g is an 

eigenform with the same eigenvalue. (c) T,g = U,V, f=; T,f= —8fby a 

comparison of coefficients. Hence { f, 8g +f is the normalized eigenbasis. 

5. (Tf. g> = 28 <p" FIL Das g> = EG CS P™ gL VJ» by (5.33). 

Since ‘all k= = p*?g(pz — j) = p**g(pz), we have <T, f, g> = p< f, P**.g(pz)> = 

p*<f,Vpg>. 6. (a) It has a, = 0. 

(b) f, = q? — 48q2 + 8-27-5q* — 64:°5-47g5 + 4:9-5-17-47g° + +; 

fo = 4 — 24: 43q? + 4-9-49- 139g? + 64-171337q* + +>; 

Tf, =q + 1080q? + --- =f, + 2112f,; 

Ty fy = —24-43q + (64° 171337 + 2?9)q? + --- = —1032f, + 293°77f,. 

(c) Tr T; = 1080, Det T, = —2!°- 37-2221; eigenvalues = 540 + 12,/144169; 

normalized eigenforms are f, + (131 + ./144169)12/,. (d) 2f, + 3144f,. (e) From 

part (d): 2(4-9-49 - 139) — 48-3144 = 8-9-5-23-41. Alternately, compute 

T;f, = —48f, + 36-2619f;, Tf. = 36° 6811/2 + *f, (we don’t care about *), so 

Tr 7; = 36(2619 + 6811) = 8-9-5-23-41. 8. Incase of difficulty, see §3.1(c) of 

[Serre 1973]. 


SIV.1 


1. (a) p9p' =(@G_9),m (Co), Dea Jez + @) (('" 2), m**) = 

(im a)» ae EN CE + d) by (1.4). (b) 1 = (Gm "2)s 80 Fy, = 

(Cin mb ( 71 [ezim + d) = pip ( 3). G my, Thus, they are el if m is a square 
mod d. a m mel a perfect square, to find y for which 7, = pjp ’(1, — 1), it suffices 
to find d prime to 4m such that (%) = — 1, since one can then find y of the form 

(42, 5) eT (4m). To do this, first let m = 2*mimo, where ¢ = 0 or | and mp is 
squarcites, Choose d = 1 mod 8 and d = dy mod mp, where dy is chosen so that 

(42) = —1. Then one easily checks that (7) = (7?) = (az) = —1, as desired. 

2. Replacing y by —y, if necessary, without loss of generality we may suppose that 
b > Oorelse b= 0 and a> 0. (a) pip} =((8 ~)), N4#./2)(G 2). Qea' Vez + 4): 
(C8 8), —iN* YZ) = (ei nd), NAV BERQeg! Jez + a(S), —IN 42). 
Note that \/(az + b)/(cez + d)- cz +d =n,.Jfaz + b with n, = Si ifa <Oand 

c= 0,n, = 1 otherwise. Thus, 
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pip =((n» 2"), nN Qeq' a(— 1/Nz) + b(—iN4\/z)) = 

((-x» 2”), m (eq! / —Nbz + a). (b) We have 

5, = (he 2%), (a ez! /— Nbz + a). Since ad — be = 1 and 4]c, we have 

a =dmod 4, and so ¢, = &,. Thus, it remains to compare (=2“) with n, (5). We 
suppose c > 0 (an analogous argument gives the same result if c < 0). Then 

(7°) CP) = Gee) = 1, and (at) = (4), so that (=) = (=}*). Now 

PY = (DC) = D sgn a = (Fm. Thus, 3, = p}p-"((5 9), G)). Again 9, and 
pjp_' are equal if N is a perfect square; otherwise there exist y for which the two 
differ by (1, —1). 3. Ato,p=1,h=1,t=1. AtO, take p=((2 9), JZ), so 
p'=((2 6), ~iVz), and we need Ty(4)3p((5 {), Dp 1 = 

(85D, WZF DCS Y), -iV2) = (Cy 9), it —Te FA 2) = 

(C1, 9), —it,/hz — 1). This is in P'y(4) if h = 4, t = 1. Finally, at the cusp —4 take 
a=(_} 7), p=((-2 9), /—2z + 1), so p™* = (G3), 22 + J). Since 

a(§ 1)« *eT,(4), we can take A = 1. To find t, compute p((} 1), Np7! = 

((-2 _1), ./—4z — 1), which is j((_2 _1), z) provided that t = i. 

5. (a) Sy2(Fo(4)) = 0ifk <9. Fork > 9, it consists of elements of the form 
OF(16F — ©*)P(O, F), where P is a polynomial of pure weight (k — 9)/2. Thus, for 
k > 5, dim Sy2(Fo(4)) = [(k — 5)/4]. (b) Since t = 1 at the cusps oo and 0, there are 
always those two regular cusps. Since f = i at the cusp —4, that cusp is k-regular if 
and only if * = 1, ie., 4[k. But 1 + [4/4] — [(k — 5)/4] =2ifk = 5, 44k and = 3 if 
k > 5, 4|k, as can be verified by checking for k = 5, 6, 7, 8 and then using induction 
to go from k to k + 4. (c) OF(@* — 16F)(@* — 2F). 6. (a) If d’ =d mod N and 
N/4 is odd, then (7) = (4H) = (— 184-9471? (Gt) 
(— D4“ DE~ NP A) = (FP). If N/4 is even, then d’ = d mod 8, and the proof works 
the same way, with the additional observation that (3) = (7). (b) The proof that the 
cusp condition holds for f \Lelwe is just like the analogous part of the proof of 
Proposition 17 in §III.3. Now let y = (2 4)eT,(N), and let y, = (_4y 72%). By 
Problem 2 above, we have p¥p~' = (1, xn(d))7,. Thus, (f|[e]y2) [Five = 
(f|Le3e Iwo) Lede = 28@) (F912) Leda = xn(4)x(@)f |[p]y2- But since 

ad = 1 mod N, we have x(a) = 7(d). Thus, (f|[p ]y2)Il Fue = ZA@)S Co lya> as 
desired. 7. O(c) =1, O(—1/2)=0, O(0) = (1 — 1)/2. 


Nl 


sIV.2 


1. Eyp(00) = 1, Ey2(0) = Eyo(— 1/2) = 0; Fy2(0) = (i./2)*; Fuy2(00) = Fy2(— 1/2) = 0. 
5. If one uses (2.16) and (2.19) rather than (2.17) and (2.20), then the solutions « and 
B to the resulting 2 x 2 equations involve x, which depends on /. 6. By 

Proposition 8, it suffices to show this for / squarefree. In that case use (2.16) and 
(2.19) to evaluate the /-th coefficient of Ey. + (1+ i)2- Fy. 7. Ayo = 

€0 —24)+ €0 —A)qt+---3; 60 — 4) =0if and only ifA > 3 is odd. 9. Use 
Problem | above and Problem 7 of §IV.1 to find a and b so that O* — aE,,. — bFy,2 
vanishes at the cusps. a = 1, b = (1 + i)'27*?. © = Ey — (1 + D//2 Fr, 

07 = Ey. + (1 — D/ V2 Fy. 


SIV.3 


2. The computation is almost identical to that in Problem 1 in §IV.1. 3. (c) By the 
lemma in Proposition 43 in §III.5, right coset representatives for T,(N) modulo 
P(N) oa'T,(N)a, where « = (5 »)> are %, = (3 ws 0 <b <p’. By Problems 1(b) 
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and 3(b), we use the a, to get representatives for T,(N) modulo 

TN) ET (NE, »- Since o% = (5 pp) 1). We have T,» f(z) = pen. 

ZSILCG 9) PO Ds DIy2 =P” Ef (EP) = Zprin An e?ninaiP’ The same argument 
works for T,v, since the Lhe ondiie tis trivial even when 8YN. 4. (3.5) gives b, 
for half integer weight k/2; in the case of integer weight k, the formula (5.19) of 
SIIL.5 with m = p? gives b, = a,2, + x(p)p* 1a, + X(p?)p** 7 Gyp2. If k is formally 
replaced by k/2 here, the middle: term on the neh differs by %~1)4n( p)./p from the 
middle term on the right in (3. us 5. Let y = (¢ 5)eT(N) be such that p ?\/b. Then 
for y,¢T,(N) we have ¢,27;) = Evite €,) '9;7. By Problem 2, we have € pip = = 4, 
with y, = (,3, bp”) eT )(N). Let t= 7" ye, (N). Then 


ITE, (Nye pl (N) Ja) [Pwo = LEA 7ve 
= Sle ptjlye = 


27, 


But one checks that the correspondence rN Mopar rN ep 2t, permutes the right 
cosets in l',(N)é,21,(N), and so this equals x) f | M), 1 (N) lwo: (Verify that 
if t, and t,, are in the same right suas of P(N) na '!T,(N)a in (NV), where 

a= Ci 9), then so are y; = yTjy_ ‘and yy = yt? “1 ) Thus, Tf ha = =7(4)T,2f. It 
remains to note that I',() is generated by I',(N) and y = (¢ ») eT )(N) for which 
p?|b. Hence T,2f |[FJa2 = x(@)T,2f for all y = eT (N). 
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